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[#ff§S£©ttB] 
^tr£fi£DNA. 
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£#81" S C £ £fcf»£-r*I!#B 1 E«©SfifcDN 
A. 

[ifi^ffl 3 ] £j£D N AarFeK^-rSQIBIIjiSifeaS 
*TT-5 fe©T&3§!3#B 1 fB«t©&J$DNA. 
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l.S kb 



[»#S4] &J&.DNAifiM&mo&M#m-&f.Z 10 

a. 

nsE^jfc^rrsfcrori&si**^ i is«©^dn 

A. 

[11*816] «*Sl&V>L5Sfe©£jSDNA£-& 

[H*B8] fttWl->i'J->7 0U (E. co 20 
11). AfJVX • ■y-^l'^ (B. subtllls) 
Sfctt/WVX • :7V ex (B. b r e v 1 s ) -?<&5 

[is*© 9 ] macm 7 s iB*©«i[*«j&igift>f 

-CiSSU ^©&£&8#*fc«iglfc*#Sfchifiit!l7 
[000 1] 

SA) &3-Hr5jte : f«:#tr-&*DNA, -&(£DN 
Afc^fS^XSF, 87^X5 FKck DJiSSte&S 

[000 2] 

[ft*©&«] ffl^^.DNAfi«5oii^lcJ:-3T, *J» 
ffi^ffl»£telC43^T&^^£&a*©S£^£&St 

©fifc^tt. mRNAfcfSSLT. a»«^8MtK:«koT 40 
ffc8LfccDNA#6i'n-->£ff S^tlZioT^ 

MBS 5 8 -5 6 6 8 4lplCc DNA©PKft^H^^n 

[0 0 0 3] gegfcs- H-rs®fe?»i-t©sagro 
73 /KB*]* 1 75 /&lCO#DNA© 3fflS*>6& 

75 ;m\zMfc-rz : 6&®mi&'f' Lfc 1 ^Tlifcu. 



[0 0 0 4] ft^T, WS©^TIS8»Sn»:cDNA 
B&^e^SS^TJfeO, &-fLfc*IBK^©Ii?«3£ 
[0 0 0 5] JS^fc&SiSiir-SfcteaSfcaSSi 

cDNAlC*V>Tttae^«t'lC#&r5*ilS 
[0 006] 

[^W«ftbJ:5£-r5SllS] ±&©*tK, S&3F*R 

-t 0 2»<fc < SESifSDN AE#J£fto fcfc©T»S £ 
fct3T©fi!a±, a^^*!ll©»jflSS(i*^ iSSHCEfi;* 

«rtc£&i*U«>-5£i£©*5S£SS:J6lEU 

<b cl <t 53 ■£> o 
[0007] 

[s®&»ft-r«.fe«)©^a] m®n& 
&®ntb 5ts s*©sest?»5 1 h mut^i 3 > 
£*§f S^©ig&£^&s»£ttC:&^T£ 
lc£gT&fc*l;:. fchJfili«7JV:/3>©73/KE?iJ 
Sri-FT-SDNAE?^ 
(D73 yKEM^bS-Br^V^. 



—584— 



3 

2 xmrnzw * fcv» * 3 era. 

5. 

[0 0 0 8] ST\ cDNASfflHT^ii (E. co 
1 i) . ttSB (B. s u b t i 1 i s) 3©fS£«rC 
t hjfaiHTJV^S ^S8»3 5 8-5 

6 6 84, ftmms 8- 1 5 0 5 1 7. &WBB6 1-2 

7 5 2 2 9, #111836 2-2 1 539 3 fct'l:: 08^3*1 

»cD«6^cDNA©^ft>14»©SS:IC. *<!>g8BMl « 

lC*^T©t bMS7Jl':/5>&SS©«K69fcii;!)n 

6 2-2 9 9 8 5Ctett£©75y&E^*Sgi»S*l 
5DNABe^J-)S**S*StlTVi5^, iSEWOJS; 
KE23|«&IBflB6 2-2 9 98 5 lCiB^$nTV^73 

smgffl£\t&< ©«&&*<**. 

[0 0 0 9] #5E91#SttISCS±4&lfcjiLfc3 HXCft 
BUT, t hJtofST^S^n-H-r-SDNASxIf 

[0 0 10] fci3. ^Urf^V^Fffl&JSICtiHJ 
I^tJVS (Nu c. Acid. Res. 1 0 , 6 5 5 
3 (1 9 8 2) ) *^*7S^ (Tetra 
hedron Letters22, 1 8 59 (1 98 

i) ) t£©#ffia<-r?ic$sg£ftT£o, ^rtKoxm 

[0 0 11] »¥» ^titfc&gfcSi'l'^F^ 

[0 0 12] *ICC:©^«DNAS1S±IC*AL. H 

[0 0 13] *^ir*5ViTffl^6n5y9X5 F«# 

JBSttSp SC 1 0 1, pBR3 2 2. pUC19, p 
UC 1 8, pHSG2 9 8. pHSG2 9 9. pHSG 
3 9 8, pHSG3 9 93S£fflWl«J:V>. 

[0 0 14] *&ttlESS»*fr5«^Ktt. pUB 
110, pC194. pE 1 9 4^£fflV*fttf<fcV>. 

[0 0 15] /WPX • :7VtfX€:!f 
PHY5 0 0, pNU200 (Proc. Natl. A 
cad. S c I . USA, 8 6 , 3 5 8 9 (1 9 8 
9) ) *SfflV>ntf«t^. »0>1L.*E^5 



#H¥4 - 2 1 1 3 7 5 
[0 0 16] COi^lCbTftfcia^aADNA 

n©*s^ffl^Tfe*v». £&-e©te©:£&£ 

[0017] fc&aasfc&fSiSL g&iEu aitaifiSi. 
nfcv>a<. a#5i$m#>e>i oo^^ai-ci^. 

[0 0 18] £&&©!& flitf^Jfett, ^frrticn&tt 

< ffl v> i e. ns h p l c&9£ i o x»Ki-n« * v>. 

[0 0 19] HT, *^MS:mfi«9lC^Tm^WlCSi 
[0 0 2 0] 

[SfedSWi] 

[££»£fc FM^lCS >3H5?©*IS6] 

»&Tte££LT#&n*DNA«»fc*:7 O^SSft 
T$>*. k hMii7;l//S>te5 8 5 7$/6tT-fc£© 
Tl 7 5 5mSCD«g^*tjJ>^<t 1 b£>S-C*>0. 4>fc 
<tt>2 5*@SlC^fUUr^J5£-r5.£«*t*5. 
2*«tUTy5^5HCiffl*atJ^S^fc5©-C, 
CD2tt©DNA£-&j£-t*i&£#<&£. F 

5©TJ±*<. 8t>gS©S5»t^tT75^^>h© 
m^SfrVi. -€-CTSSE?<J©«S!&fTt>T)!)^^<*:Sr 

[0 0 2 1] £U:©lfFS*fr*t>tlC, 
(Dt Vtim7)l>7S. >©75 /SEJU^'fbS-efeVi. 

[00 2 2] (^S^^SW4SSSi<. ) 
I ®N5RSS©!ft5^<j5<lcae^T*-©«!ll®B»lSB 
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5 

i -DrnzitZo (8*&58^Xx2^$gl;:jt£ 

©^k® Ttaffl $ nt v> 5 ae^pff ^ £ & * ^ < ^ 

5 (FEBS LETTERS 5 8 , 1 34, (1 9 
75), Nucleic Acids Researc 
h 9, 6 1 0 3, (1981), Proc. Nat 
1. Acad. Sc i. USA, 7_9, 7 1, (19 8 
2) , J. Biol- C h em. 2 6 1 , 6 7 4 7, 
(1 9 8 6)). 

[0 0 2 3] *^^e>^ -iKKDNAOffi^fc^fc 

ttfc<, mRNA«fcOf£Sb£cDNA©:KSE^J£ft 
feTZZ£\z^r7$;m&m*m&lsT^Zit&& 
il^iR, gp^ Nucleic Acids Re 
search 9., 6 1 0 3 (1 9 8 1) StfPro 
c. Natl. Acad. Sc i. USA, 7 9 , 7 



4) ftM¥4-211375 

6 

1, (1 9 8 2) Z3L\Z&*\ZV1to 
[0 0 2 4] L^U ±i£(0 2^CD^:SltlC*$tlfccD 
N A 6S9f £ ft S 7 S y KE*IIC % 2 *ffi<DfflSj&&* 
ife£. -r^t>-&1^^8&JSa)SF»*^lto^mRNA^ 
£SJtLfc'b<D (Nu cleic Acids Res 
earch 9., 6 1 0 3 (1 9 8 1)) , t>5 lz>\t 
^AcDfffa^^^fcmRNA^eSJtl/fcfeO (Pr 
oc. Natl. Acad. Sci. USA, 7 9, 
7 1, (1 9 8 2) ) T$>*. 
J0 [0 0 2 5] *&Wmz>\t&m&ttZ-T&AO$im*: 
Sfflbfc. n>lfzt-*£^T75/KE^S>JSD 

[0026] -tco^aE^^n^tr^-^oiis^siigtt 
[0027] zvwimvrc&fc^o^ FxD&mmG 

[0 0 2 8] 
[SI] 
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TTT-PU* 1(0. !7X) 
TtT-Tyt 0<0.QO*) 
TTC-Ph«ao<S.13tt) 

TiCTwiaca.oasii 

TTA-Uv 0(0.00*) 
0(0.00*) 
TTCl.o 0(0. 00*) 
T*6-«« JKO.JQX) 
CTT-Ui 3C0.51*) 
CAT-OU 0(0.00*) 
CTC-Ua 1(0. I7K) 

ca<muib<2.?<*i 

CTA-Wi 3(0. SIX) 

caa-cu uo.nx) 

CaS-Cli,l9(a.2S*) 

att-iu o(o. oo*> 

AAT-A» 0(0. 00*) 

m«ni*0(o7ooK) 
MMTigg(9,aiH) 

AAG-Lyi 2(0. 34*) 

«4T»Atp KO.HJi) 
CTC-7il 1(0.17*) 
Mfr«»p33 

m-fMII(l UH) 

6Afi-6l« 4(q.68X> 



[0 0 2 9] T*S£ftbfcSS#fcJ:> *gS8T*glCfSSi 

bW?h7T>* *niC3H>a— fe-^fcfrflJDjSftJl 
^,2., 1 5 4 1 (1 9 8 3) ) B ±C©J:5JC»atA 



TCT.S«flS(2.56tt) 
TCT«Cy» 0(0.00*) 
TCC-Sir 8(1.37*) 
TCCC T ,3S(B.98*1 
TCA-Str 1(0.17*) 
TCA-«. QC0.00*) 
?C6-3«r 0(0. OOM) 
TCfi-rro 1(0. if*) 
CCT-Pro 0<O.00X) 
CCT.Ar.MCl.Itltt 
CCC-Fro 0(1.00*) 

ccc-iriioci.Tim ; 

CCA-Pro 1(0.17*) 
CBA-Arg 1(0. 1?*) 

cec-ira 0(0.00*) 

mjfcl Ttl.IOtt) 
ACT Sar 0(D.00*) 

KC-Ur 0(0.00*) 
ICA-Tbr Q<O.C0*) 
ICA-Ar« 0(0.00*) 
ICC-Tfcr 0<0.00*) 
ACC-Ari 0(0.00X1 

CCC-lIi O(O.OOX) 

see-sit am. s fur 

6GA-6lf 0(0. 00*) 
866.61*. 0(0.00*)" 

[0 0 3 0] 
[&2] 
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m-TjrU<2.05H>. 
TTCPhilOH.?!*) 

tac-t, t sn.nsm 
m-ioion.?i*) 

TAA— • I CO. IT*} 
ITC-L«»i2(2.05X) 
lAfi— • 0(0. 00*) 
CTf.L.«,l8U.07H) 
CU-IUUC1.88X) 
GTCLai 5(0. BS*) 

CTA-lo 4<0.68X) 
CAA-ClilOd.71*) 

; «T>Mi 8(0. SIX) 

. iiT-isiioa.iix) 

AACAf 7(1.19*) 
ATA- II • 1(0.17*) 
AAA.L„4H7.00*) 
ATC-H.t 6(1. 02*) 
M6'L»jI9(3.24K> 
CTT-TUlin.BB*) 
CAT.Up85U.27H) 
CTC-Tll 7(1. 19H) 

isum 

CAA-CU3T<fl.Sl*) 
CTO-f IJUftJSH? 
6AC6lt24(4.10X) 

[0 0 3 1] $T> m K^LfcE^llCfc^T, SlOtC 
fc^AAGCTTMindl I I l^tefifirFflliStf) 

tt fc#«tlTV>* Fokl£02 4><fc? fci»A LT#I D 

[0 0 3 2] Fo k I li^aBStt (0 9*15/1 3$Bg 
(±ffl&/Tffi!®) 3' «t«»-r*OT» 

cfcDjh»7;i/^5>Jtfi^ON*«ifi<T«IBfT*Sct 

[0 0 3 3] »e^#<0«^lCffiV^M(E»3tlBH i 
ndl I I, Kpnl, Sail. P s t I , X b a 
I, SphI, BamHIilL Ctlf.©BSTO«J 
ffo5ife0£03lC^Lfc. 

[0 0 3 4] DNA0fc»6a 
RlrL&DNAEJ< mi) *®4<D£5lZy7>?*> 



10 



TCT-Sir 3(0.51 K) 

fCC-C,«20(3.4lttl 
TCA*3«r 6<1.02H> 
T64-..* 0(0. 00*) 
TCC-Sir 2(0.34*) 
TGS-Tro ICO. 17*1 
CCT-?rol0U.71H) 
fiST-ftri 2(Q.3<X1,,, 

CCCPro 6(1. 02k) 

fifiMn 1(0, 1TH) 

CCA-Pro 7(1. 19X) 
CGA-Art 2(0.34*1 

CC6-Ar t 2(0. UH) 

ACT-Ttr Tfl.j9tt) 

A«T-$tr SU.02X) 

icc^Thr uuuai 

ICC-S«r 2(0.34*) 
ACJ-Tbrl2C2.Q5X> 
A6A.iril3C2.22X) 
ACS-Th* 2(0. 34X) 
AG8-*n 4(0.68*) 

6CT-6;y ^0,34*) 
GCC-ALN(2.39*) 

GC*-AI.1T»,90*) 
GCA-Cly SCI. 02*) 

CCtf-AU ?(.g,34X) 

CGC-Gly 1(0.17*) 

McftflU Applied Blosystemsit 

X2fr7Sy>f hft (Tetrahedron Lett 
ers 2_2. 1 8 5 9 (1 9 8 1) ) tCcfcO-W? 

[0035] m&Fomm 

^iftDNA02 6 o nmvy&tmzmfeLTmmz 
40 &%.Liz'&\z. i@wjftf^T*5i o o t:n*;v£ffl^ 

fc. 03. 4(C^L^«IS9^T8aoyD-/^^ 
tt* S^D 7 ^ ^ «St5SSrfr^a$7nHH/, 
T 4 U tf—tfT^y-v 3>lT^Dyi? KffiSf 
*»rfr££j££i*\ m^pUCl 8t>U<ttpUC 

1 9lC#D->fcLfc. ^a-Xbb^^Pu/^OD 
NAE^J^yf^vS (Science, 214 , 1 

2 0 5 (1 9 8 1) ) \Z£'DT'J>te<t%2mzt>ft'0 

ttlftl m gtpUC 1 8£fcl*pUC 1 9#jl z*g£ffl 
50 ^T7^y-: >3>&ff\<\ ?Uyi; 1, 2, 3 t?U 
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feWM^D-^^pUCl 8£fctepUCl 9\Z>7U 

UCl 9*91 tfg&fflW^y-: xa>£fr^ ^ 

A£#ligbfc (0 3) . 
[00 36] 

AfcifrfcLfc. ft** HB+» SDttU#y-A&<&8B 

5FpHSAStf^77>HpT13s (Nco) 
(J. Biochem. , 1 04 , 3 0 (1 9 8 8) ) 
tfr*>®6 tZOfrJ: V \Z38$tfv Hp SDHSA4 
£ftjSLfco fc*. ^7X$HpT13s (Nco) 

S«cAJ 1 2447 (FERM P-1 0 7 5 7) 

[0 0 3 7] ^S^7X$HpSDHSA40ifi 20 
O#«««TOffi0T*S. pHSAfcFokl 
tBamHITMU (^b hjfiifS 

7;l^i= >m&=?<0*&#*^tsm 1 . 8 k b Bffr) « 
MB?*. — pTl 3 s (Nco) £C 1 a I £B 
amHITMU *£^;fr<D8rtf- (t rp?D^ 
7>^y>»tt»fi**^tftt 
2. 6kb8r>W £BBBT*. £Offi^<h0 5 K^Lfc 
6«DNAt$T4U^T7-fy->'3>lTpS 
DHSA4$i«U. 

[0 0 3 8] Z(D£V iZLT&Ztlft^y X* F v SD 30 

HSA4lt trp^D^ *1U-?-<Dmm 

T, Me tSS«CjS?SMHSA^itgafeL^Sa^ 

Tt r pA^-$*-^-S:iATV^. 
[0 0 3 9] ftl:u^S^7X^HpSDHSA4T 

B 1 0 l**«JtG&U ff*Slte&#HB 101/pS 
DHSA4£#fc. £01*0 -X, BSX*^, 

KHiPO*. NHiCl, MgSO*. CaCh, fcf* 
^>Bl^trig«iT^L^. «S»ji&a4^raT>f 40 
>H-*7ir'J;i«K±*ll**Wt, «*&ifol 5P# 

[0 0 4 0] maS, 2 OmM Tris-HCl 3 
OmM NaCl 0. 5M EDTAAy 77— tCJB 
St. 0. 2 5mg/miyyf-At0tlWBlS!H 
j8«»«»bfc. 3I&£<££iSa:«, 2 OmM T 
ris-HCl 30mMNaCl 0. 5M EDT 
Arty77-\ZT&&* BtfSitU 1 OmM EDT 
AjSflfcfcKSU m&WfttVfz* 

[0 0 4 1] 07 (A) teHB 10 1, HB 1 0 1/p 50 
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s dh s a 4 <D£m&m&Rxfmmft* s d s * u 7 
^u;v75Hea»aifcHT*So 0*01, 2, 

3, MOBS^H«Toa0T*S. 
[0 0 4 2] 1. HB10l£K{fc^& 

2. HB1 0 1/pSDHSAEl 2£K#gS 

3. HBlOl/pSDHSAEl 2flHftH& 

M. fl^a^-*- 

HB 1 0 1/p SDHSA4 CD gT&HS g^OH 
Bl 0 lfcfcULfcttfc^^Sfte 7K0/t>PfttB« 

[0043] 07 (A) tftgscomaaciHg (saatt 

1/3 0) , ftHSARrnxX^^D^f-f^ 
£fTfc5£0 7 (B) CD<k^ft^^->lCftO. 

[0 0 4 4] ^fi$6M^y7-^>T^JSfl:U ^ 
mKh-MMT (final 0. 1M) 1 

It. cn*75/B3^*x>*-fc3Mt % N**8ttfi 
bserved te^&fceSSnfcffi^Jfc, Predi 

c t e d\tttLitmm**n*n7fc?o 

[0 0 4 5] SltoztfrS. *aMfcfev>TN*lifc- 

[0 0 4 6] iB6MHB 1 0 1/pS DH S A 4 
(A J 1 2 4 9 8) te> Xlgttffi&»£fcX£W$SmK 
^ffcSn-CVi* (FERM P- 1 1 2 0 8) o 

[0 0 4 7] Sfi^6My7^>T-5ISfl:S, IMy 
ftxH h-j>£i/i oaim^Ti 0 0t:2»TM 
TG&frV^ »fflHPLCK<fcoT£SUfct::5*igSI 
iC<fc^)t hJtofB7;^5>(O^Sttl 5-2 Omg/ 
L/O. DT&ofc. ^096 1 -2 7 5 2 2 9Kte, 
*BSfc:m*S&iiS£jfcfi5-l Omg/L/O. D*t 

[0048] 
[§fc»«3] 

[h HMI^Jl^/S ><D#lffi\Z&»Z#1&£m] 

PUB110 (J. Bacteriol. 134 , 31 
8 (1 9 7 8)) ^ISO^^-pBR3 2 7 (G 
ene 9_, 2 8 7 (1 9 8 0) ) ^EcoRI«8fi 

^^-pBU4 3 7 1 $»il/L PBU4 3 7 1 

(09) o 
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[0 04 9] tt£SCOa-7^7~ififi^amyE<& <D$ifr£* FpASEC 1 £No t I, Sma 

0*>, a-T$7-V<»%$lt / Amz&Wti;m\*, m \T&mhT&1t7. 5kb<OWfr££T4U#-if£ 

0. 4kb<Dmmz&tEVT#K>, *§§®/3-l ac t ffl^TH-&**fc. Z.<D&Z){ZVT%t>tl1t7y7>$. H 

a m a s e 7* 5 F p TUB 2 5 tffc hitaffi 7 >#i&:/^X £ Hp AMY3 3HS 

6 (Biochem. Biophys. Res. Com A4W (014). 

mun. 1 34 , 6 2 4, (1 9 8 6)) Tfct Z.<Dm [0 0 56] feggfclckSb: hiMiyjUr/S ><Pft» 

«*t 0 . 4kb H 1 nd I I lmfttisTnZtlZo ^^«^^«$aiCA¥L#Sttm® 1 A5 1 Ot* 

[0 0 5 0] 01 0tt* a-7a7-ifO»»lC^T (J. B a c t e r i o 1. 1 6 5, 9 3 4 (1 9 8 

*D»»^^tt«0(B*n*^y^;V^^HC!)«K^ 3) ) «±i^775 HpAMY 3 3HSA4TyD 

(A 1 a 3 3) MittOTS/KS^JXr/DNAE^J^ 10 Y7v7s h&fc£08*SmftU J£iM£&8cl A5 1 0 

UT^£. tt^O^>/^S©fie^^ftE^UIt /pAMY3 3HSA4$ifeo 

^^;^y^H«Wr^<Dlt«lcai|gr^rc«)Jw^ « [0 0 5 7] ^©<fc'5»CbT#^HI£^«;l A 5 1 0 

K^lifiJtBWfflfi^N^^e^ 7:=/«E /pAMY3 3HSA4 t3>hD-;VtLT^X5 

^J^A«>e:a:^<^--^^IS»*SSffi ; &E@U HpBU4 3 7 l £*rr*^HIfc8M*i A5 l 0/pfi 

^0DWS:^»r/ai:N*Sg^TS^frf^cfc^^75yK U4 3 7 1 ^0)^^hUyh>. BSX+X, NaC 

(*360)75yKS^^#^e>nSDNAS^J^t>tlC 4. 16, 1 8 ^rWTJ&««&tJ->^U >^U Jg*± 

pJie^fflK»*S5tt^»«^fet^5, HapUM ffi£l/z lTt?llHlR^51aI^>f D>^>^^>JCX# 

<Di£&* Al a 3 0£3-K-f*E^£GCT^£GC y h LTJfCt: Hifiifl!7;i^ >SMfr«ffl^TH y M A 

ctcesi-r^cit^cfcoTPt-oNo t iw^at a? /:/nyx^>^£frfrofc£c;5, gji 5i£^-r£5 

#*££j&<¥Bfco fc % t hifiifl!7^a>*ts^^»^LT^C:t 

[0 0 5 1]^iir, BllCi^^DNASAp jWSBSnfc* 3**, BHtC^tStandard 

plied Biosyst ems &K<DDNA'&j5£tS sit SI GMAOE ssent i alglobl in 
MUTMU 2<D£5t;:LTiHJ?#*B'^ free HUMAN A 1 b urn i n £Jl§V>fc. B 

^pASECl&«*Lfc. pASEClll Itl* r o t hOfiglClt JgifefcXtf^ HL&. 0*<D1, 

No t I tSma I t£J:0)l3M3H£T"0> 3 ' 2, 4, 5 CD&SKfcU A5 1 0/p AMY3 3HS A 

*S^¥?tft;bTN*S^jS©aS^SIS»*ET^»r 4 3, 6 OffifiKtel A 5 1 0/p BU4 3 7 1 £ 

LT££, Wt£5' WNo t I cohesiv ZrtuetlXtfy HLfc. 

eT3' *JB^«KB* EK£5£«5 ^«DNAT1 [0 0 5 8] J&efcSM* 1 A5 1 0/pAMY33HS 
ft-tZZtlZZ^X. amyEOi/^M^W 30 A4 (A J 1 2 4 9 3) 1 1 A 5 1 0/p BU4 3 7 1 

^tftjs^awMei^Bais^L^a&f^MS'r^ (a j i 2 4 9 2) nfrnwi^m^rnxsm^miz 

z.£&-v$z>±5\zfe^T^z>o ^te^nrv^c ^co?pt€#^«> iasio/pam 

fO 0 5 21 HSA#jftgg*3 F<Pffi£ Y33HSA4^FERMP-1 1 2 0 7T> 1A51 

^T01 3IC^f<fc^^2*CD^*DNA$r^SbfCo 0/pBU4 3 7 1OTERM P-1120 6T& 

CO2^<0^«DNAtHigflllT*Sbfc^^t:h S. 

jfiifS7;i/^^>ae J ?^t?y^x^ hphsa (S3 [005 9] 

#88) Stf^77^HpUCl 9tft*>, ?5X5Hp [238094] 

UC3 3HSA£«SgLfc (013). [^&^t MfafS7;i/^>»fe^C0«|^] 

[0 0 5 3] Z<D?7X$ HpUC 3 3HSA©iSO gg^OSfr 

[0 0 54] HP^, pHSASFok I tB amH I T frfcfrofco fc bM&TfrfS ><07$ /SE^JIittft 

1. 8kb»rm £PiK-r*o T^^fflB^fe^ (FEBS LETTERS _58_, 

pUC 1 9£BamHI tHindl I I TSJBrLTfc 134, (1 9 7 5) , Nucleic Acids 

< 0 ZtlZtmi 3+fc^Lfc2*<Z)'&j£DNA<h£T Research 9_, 610 3, (1 98 1), Pr 

4U#— t£T»*SU IWO^X^ HpUC3 3HS o c. Natl. Ac ad. Sc i. USA, 7_9, 7 

A&m&lsft. 1. (198 2), J.Biol. Ch em. 2 6 1 , 

[0 0 5 5] £T#K:/5*£ HpUC33HSA$S 6747, (198 6)). 
IB»*BamHnr«l31bfc«fc^l/y^aaU [0 0 6 0] H&tCDNA<De?lJ£3fcfc 
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(9) 



-211375 



15 



16 



s j mtem & v ttmzm&i< nr v- * 7 ^ j ke 

^St>#®ft^K^ ^J»rU J . Biol. Che 
m. 2 6 1, 6 7 4 7, (1 9 8 6) £ffi£Sftfc7a 



[0 0 6 2] ^0«ffiE^I^3>tfaL-^0ii5*«iStft 

[0063] com^LTcmm-'vo^ vxDBmms 

[00 64] 
«3] 



TTT-Pi« 1 

nPuIn 0 

TTG- L«u 0 

CTT-L«u 3 

CTC-Uu 1 

'C-L«u54 
T 



TTTff 



UG-Htt si urn 





TAT-Tyr 0 



tIFm* 

— „ TAG-*" 0 
00*) CAT-Ms 0 

m m wm 





CG 

CGS-in 0 
AGT-$«r 0 
AGC-Sar D 
AGA-Ari 0. 
AGG-An 0( 
GGT-Gly B( 



T7T 
.oo* 
.oo* 

.00* 
.00* 
.00*) 




[0 0 6 5] T***Lfc«#tt. *§§ST*SK:385t 

Stlfct-^XTH' >©3- KTfc* (#%3t« : fflJBX 
3*. 2_, 1 5 4 1 (1 9 8 3)). ±&<D&5\Z\Zt/v 



GGA-Gly 01 TOOT) 
QGG-Gly 0( .00*) 

> £ 3 - * X £ V >«#(&D N AS^IIC V>T m C 

[0066] 
[*4] 




TAT-TyrlZ 



TAA-*** 1 
TAG**** 0 

cn-nitii 

CAC«Blg'5( .8 ! 
CH-Glnll "T7BL 
GA G-Gln T P l»54l) 

urTTnn V.lli} 

m 



1^5*) 




TQT-Cyolfi 








TGAr*** 0 
TGG-Ttp j 






SB! 



_ AAC-Aan 6 

TTwfif 

.34*) AAG-ly»19( 3.24*) 

" J * GAT-A*p*5< 4.27*} 



GAC-Glyilt 




[0 0 6 7] ST, 01 6lC^tfeE^C^^T, §«J 

cssAAGCTTOHindi i lumm&^mm 

«&£#»nTV-£Fok I ^a2C0J:5^»ALT« 
[0 0 6 8] Fok I «^SStt0 9JfiS/l 3«S 

[0 0 6 9] ae^ft^«s«cffl^^*is»*«H i 

ndlll, Kpnl, Sail. PstI, Xba 50 



I, SphI, BamHItUfc- cn^>0»igTO« 
l»^0^a 1 7 fc5KLfco 

[0070] DNAOftf^a 
KfrbfcDNAE^J (016) SrEl 8W«t^«C7^^ 
*>MC#§fll^ Applied Biosyst em 

£*X*75 ^hft (Tetrahedron Le 
tters 2_2, 1 8 5 9 (1 9 8 1) ) fclckD* 

[00 7 1] &&r?<Dm& 

&f&LftvN a<d2 6 o nmvg&tm&mfeLTmm* 

llsKDSf^Tttl 0 0bfn^;V$rfflU 
fee 017, 1 8^Ufca^»*T8^0^Dy^lC 
t>», &^ay^^«S;t-^SBr)tOpa^7^-;V 
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B-r*KH-*t*a*» -en^spuci 8feL<«p 

UC 1 9 fc£n->ffcbfc. ?U->{kL1t&7Uy9 
ODNASE^lSyr^+vS (Science, 2 1 
4., 1 2 0 5 (1 9 8 1) ) IC<koT^<tt)2tBlJC 
bfeoTSBlfc^ &?Dy£<B»rK-£lBKbfc 0 & 
iC^fRfrffil WgtpUCl 8£fctepUCl 9*5 1 /i 
g^E^T^y— ->3>£frK 7Uy#l, 2, 3 

gJfcbfc*BM?):/n^£pUCl 8£fcttpUCl 9 
fcjrn->ffcb&. S«lc3^o+rawyn-/^tti m 
g£pUC19»l M g^ffl^T5-f >3>«rff 

KpHSAE2£ttlgbfc (017) . 
[0 0 7 2] 

ST©56Si (1) ] 

AfcffrSbfc. fcfc* f^H*. SDteU#7-A|g'&fi& 

5HpHSAE25tf^5HpT13s (Nco) 
(J. B 1 ochem. , 1 0 4 , 3 0 (1 9 8 8)) 
tfrZm 1 9 \Z7ik~r &5\Z?&fcf77s^ HpS DHS A 
E 1 2ffrSbfc« ^7X5HpTl 3 s (N c 

o) X*tt*Rl^*XjWBW»fc:«£SnTVi* 
«8I*AJ 1 2447 (FERMP- 1 0 7 5 7)* 
SPSlbfc. 

[0 0 7 3] ^0K8^iHpSDHSAE120 
BI»©»IBttKTCDaDT»*. BP*. pHSAE2£ 
Fok I tB amHITMU »*>**fcWfr O&J* 
k hMTJ^S >*e?©*«5»^ttf»j 1 . 8kb 
»rfr) £IB«T£>. pT13s (Nco) £C 1 

a I fcBamHI-CWKrU **^*©»rfr (trp^ 

**tt»2. 6kbtfftf-) E©W#£B5 
fcS Lfc^flED N A £ * T 4 U T9-f ^ 3 > 
UTpSDHSAEl2$fiiLfc. 
[0 0 7 4] £<Dcfc5lCbT«£tt£:^X£FpSD 
HSAE12I1 trpya^ © 

mkt% Me t«scdaas!HSA*tB«ajieb&ge 

[0 0 7 5] £©58^:77X5 F p S DHS AE 1 
2 J: < JH V> &tUMMs*K3>«r AJS*ffi^T;*:iB 
fHB 1 0 lft&^HCSlU JMHMfcHB 10 1/ 
pSDHSAE12^o £©tt*iOHi-X. »S 
X*X> KHiPO*. NHiCI. MgSO*. CaCl 
tf*5>Bl*£trtMlT*«bfc. ig«R!*&&4 

»1 5^WiS*bfet^5v B#l*fcBft*«*j*bTH 



L0) 4$K¥4-2 11375 

IS 

ft. 

[0 0 7 6] 2 OmM Tris-HCI 3 

OmM NaCl 0. 5M EDTAAy77-l:» 
1U 0. 2 5mg/ml Ul/^-ATO'CmWI 

a, s^8E8S»bfc. jsa&®jts«:«, 2 0mM t 

ris-HCl 30mMNaCl 0. 5M EDT 
A/^77-KTtt#, mfiSfcU lOmM EDT 

[0 0 7 7] 07 (A) 14HB10 1, HBlOl/p 
10 SDHSAE12©±B*SeRrWSttH»SSDS# 

2, 3, MOBS^^TOSDT*^. 
[0 0 7 8] 

1. HBlOl^S^ga 

2. HB10 1/pSDHSAE12*I«:Sa 

3. HB1 0 1/pSDHSAEl 2%mmft 
M. fl^SV-*- 

HB 1 0 1/pSDHSAE 1 2 <DmftW&\Z\Z* lt£ 
(OHB10 lfcWA&hfcV*4HP«»6 

20 B«e>e>n> *n»«, HttB#icia«i3ftT^*. tHfa 
[0 0 7 9] H7 (A) &ntt©«auMMfc (sasra: 

1/3 0) > JiHSAttflcT^xX^^Dy-r-f >y 
£frfc5£B7 (B) ©<fc5fcr^->fcfcD, Rtt* 

[0 0 8 0] «tt&6Mtt[»y7^^>T"5I«flSb, v 
f*XKh-MMT (final 0. 1M) 1 
A? 0 0C2»Klft. iSfflHPLCTS«tSe&«KL 

©75/KEHI£H'*fc£;i5* B8©<fc3fc, Pi^fc 
1 6 7^y»aS©^T^— StL^o fc** ^B*> O 
b s e r v e dttftB£ail£nfcB3V*. P r e d 1 
c t e dte^£bfcE*J£^n^n*vr. 
[0 0 8 1] &±©!li^6, ;*JMrfci5lf>TN*iifc 
Me t3»©f#ttbfc»TthJllfll7*^5>*«Dia* 

[0 0 8 2] ffiea»HB 1 0 1/p SDHSAE 1 
4? 2 (AJ 1 2 5 7 6)H X«a«K»t«XiI»fi(Sm 
fc«fftSnTV»* (FERM P-1 1 804) . 
[0 0 8 3] WKt6Mif7X^>T^nWfca, 1M? 

ftxw h— inti/i oasD^Ti o o < C2»ra 

5cftffV^ »«HPLClCctoT)t«bfctC5#*il 
iCi^t hifiltS7;U^>©**fittl 5-2 Omg/ 
L/O. DTfcofc. WWB6 1-27 5 2 2 9KI*. 
*»Mfc*»ft«»±d6*5-l Omg/L/O. Difi 

S£ 2 ««±±E* t>OT*^o 
[0 0 84] 
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t«6BW63 



(11) 



#ffl¥4-2 113 7 5 



[0 0 9 0] pHSAE2^FokItBamH 



pUBllO (J. Bacteriol. 1 34 , 31 pUC 1 9£B amH ItHindlll T«»rb 

8 (1 9 7 8)) ^ffiB^^-pBR3 2 7 (G T*5<o CltlS £0 2 0 ^fc^Lfc 2 *^SDN A £ 

ene 2 8 7 (1 980) ) t^EcoRlM £T 4 U tfTSJSU B«<Z>:/^7^ H pUC 3 3 

^^-pBU4 3 7 UiSbfv. pBU4 3 7 1 [0 0 9 1] ST&fc^XS H pUC 3 3 H S AE£ 

*]£®TI;i7>t^>U>fiH4, tt*«Ttt*^>f «R»*BamHIT?»31Ufcaic{rl/y^«aU * 
-»Irtt££K j rr<5 (09) . i0 V^TNo t ITSaST^diitCioT^en^l. 8k 

[0 0 8 5] ttm®<Ba-7S5-iffie^amyE<7> bOSrfrt, ^XSFpASEC 1 £No t I , Sm 

55, a - 75 t»»fcMft«»lt #5 a I Tj&3£LT#fc7. 5 k ba>«*fc?&T4 Uff— If 

0. 4kbO*«i:ff4l/T*0, *B§B0-1 ac t fcJBV>T»£*tffc. Jia)J:3fcLT»6nfc^X5 

ama BctttWr»*t*^SPpTUB2 5 F#fc h*«7*^5>»»^X3 F P AMY 3 3 H 

6 (Biochem. Blophys. Res. Com SAE2TM (02 1) . 

mun. 1 34 , 6 2 4. (1 9 8 6)) TfiL £<DS [009 2] tegBfci:* fc MajjlTjkgg ><P#» 

ttPO. 4kb Hi nd I I lSfM-tLT#enSo 3»*fc&«»afcA*L#*ttM 1 A 5 1 0* 

[0 0 8 6] 01 0«* o-75 7-ifCD^l:MT (J. Bacteriol. 1 6 5 . 9 3 4 (1 9 8 

* 0 D»Sn* ^^Jl^^ F<Bl2J0r/& 3) ) tja©^W5HpAMY33HSAE2'C^ 

(Al a3 3) «SE©75yKEWaycXDNAEJB«» # n httfcJ;D»K«»U »JME8l»l A 5 1 

UTVi ffi£<& 9>ntrK (DrnGrrZlttEmmtZ UZ 0/pAMY 3 3HS AE 2 

5/^^^H««A©*«fcW«"*fc»fctt « [0 0 9 3] £©*5lCbT»&»HCaH*l AS 1 0 

Kjflioiatt t a«»K?©N5fc*®iMHc, 75 / KE /pAMY33HSAE2 t3 > ba-jvthx^yX 

£ tft<on-*ft«IH»»»tt«E«U 5FPBU43 7 1 £*T«ttME*Mft 1 A 5 1 0/p 

^©B£«KjS«!:N*S$TS035:^<J;5^7$;S BU437 1 t©i^h'jyh>> ##X*X> Na 

E3«t3-HT*^rtDNAT3«Brr*tJ:^. SJSrA CI, >£^tf«ifiT 3 7 "CTSSigftLfco 1 

f*5E«D7Sy»EJ«*6#JL6ftSDNAE5!l*t)tfc 4, 16, 1 8«fWT»««**>^U >3^U ««± 

^HBftMB»*«Mt*«*^fct^^» Hap I I88& tS£ 1 M 1 f ^ 1 ®Rt*5 @:M n>/ >^9>ICX#" 
<0i£&, A 1 a 3 0 S3 - FTSE#J£G C T*6GC bTtt t hMS7JV:/^ >£i#£ffl F^MA 

Clc«l»1-*^tKJ:^Tit-ONo t £0 /^Dyf'f >y*fTfcr>fcfcC5, HI SKSTiS 

SSCfcj&tflofc. ^ k Hffl7JW5>«t«»l:»8tlSLt^fc:fc 

[0 0 8 7] *>£T> 01 lOJ^ft^rtDNAftAp aUWBSftfc. ft** BH*fc*V>TS tandard 

plied Biosyst ems ttficDDNA'njStl s te, SI GMACOE ssentialgloblin 
MWTIMU &lC012 0<fc5fcLTi&ffl#»^ free HUMAN A 1 bum! ntHWfc. B 

^-pASECl$«St^ pASEClll HtlS rothOfiBim &M&7>#y b Lfco 0*0)1, 

No t I tSma It«»U ffit©Btt»firF©3 f 2, 4, 6 OttBfctt 1 A 6 1 0/p AMY 3 3 HS A 

*«S^»ftUTN5^#ffi©aS&«IIS»iRE-C«» E2S, 3, 6 ©ttSKtt 1 A 5 1 0 /p BU 4 3 7 1 

UT*5#, Mt$5' sMtfNo t I c ohe s i v S^n^tlT^y Mfc. 

eT3 B 5(£«3Jt*IIB»*Efcft5J:3a:-&*DNATa [0 0 9 4] MICSM* 1 A 5 1 0/p AMY 3 3 H S 

ISTSEilCkoT. amyEOv'WK^HW 40 AE2 (A J 1 2 5 7 8) t 1 A 5 1 0/p BU 4 3 7 

At«*oBW«at«w»aitufc*fiHP*:«i*-r* 1 (a j i 2 4 9 2) & x3Ra«Ka*»x«8^9f 

Cij^*S*3fc:ttt>TV>*. l:»KSntt^. *©»Ktf It lA5 10/pA 

ro 0 8 81 HSA^7X5 KOgjjg MY3'3HSAE2#FERM P- 1 1 8 0 6T, 1 

*fH2 0fc*r±5*2*©^J«DNAtfMlb&. A 5 1 0/pBU4 3 7 l^FERM P-11206 

Jll*7;^5>il-B?ftfttr^9X5 HpHSAE 2 [0 0 9 5] 

(H17#ID *tf^9X3HpUCl 9t^6, :/9 KIW7] [ftfrjfth: hJfr«7;V?3>»^®;*:lll 

XSKPUC3 3HSAE£«fgLfc (02 0) . MT©5£S (2) ] 
[0 0 8 9] HpUC 3 3HSAEO«8 *»MTOt>5 1 r>©5B8l^9X§ F£02 2CD£5t:: 
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21 

HpSDHASE 1 20 t r pA*-^ 

0. 3kb BamHI-HincI I»fr$:pHSG 



22 

nTVi^> (FERM P-1 1 8 0 5). 
[0 09 9] 



2 9 9<Z>BamH I -Hi nc I ItM M3M8U p [589K&SS*] Ig(46mTfflV^3H>^lt 

KT 9 1 fcA^r-So ^KpSDHASE 1 2©trp *<fc?l::LTx1f-f >bfc-&j£DN A£tf^TB«£-r 

z/n^e-^-^trs o bp EcoRi-ciaiw ztbfam7)i>zr*>*&m2#z*&yi\t. cDNA 

1 a I -BamH I Bffri:SpKT9 1 <DE c o R I - ©*tt©^3S4S«*jEU» ^D^W^iaStl^ 

BamHIWhtaaU BWO^XS F pKT9 fr3±T«»T*»ti:'b«!>'C**. 

lHSAE4Sffco [0100] 
[0 0 9 6] ^l:u©^7X$ HPKT9 1HSA 10 VS&OQ E*»J#*t : 1 

E4-C«#J:<ffl^6tl**<WVK^ T >Att*ffl^T* ffi#I<Z>g3 : 1 7 8 1 

igHB i o i ttevmatu mmm&WHB i o 1 s^osi : 

/pKT9 1HSAE4t»ft. d<B«c££J&#J 5 £H ftoft : 

[0097] mfcm 5 &:n«iciRtt«nnu H«fct E^io^a 

Kltoffl7;V^5>C0Sa«:fTofetc:^, £*£gte8 0 »»ft*TE^: cleavage-site 

-9 0mg/L/O. DTfctK £««5<D£}d«£3 SHfiEttK : 2 1 . .26 

[0 0 9 8] HB 1 0 1/PKT9 1HSAE4 (A J 20 E9V 

1 2 5 7 7) 11 Xlitt«B^4fixjKmaric*tt$ [0101] 
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10 20 
5' AA GCTTGGGATG GAC GCT CAC AAA 
Asp Ala His Lys 

60 60 70 

GAC CTG GGT €AG CAA AAC TTC AAA 
Asp Leu Gly Gin Gin Ass Phe Lys 



30 40 
TCC CAA GTT OCG CAC CGT TTT AAA 
Ser Glu Yal Ala His Arg Phe Lys 

80 90 
(JOG CTG GTT CIG ATC GCT TTC GCT 
Ala Leu ?al Leu lie Ala Phe Ala 



100 110 120 130 140 

CAG TAC CTT CAG CAG TGC CCG TTC GAG GAC CAC GTT AAA CTG CTA AAC 
Gin Tyr Leu Gin Gin Cys Pro Phe Glu Asp His Yd Lys Leu Val Asn 

150 160 170 180 190 

GAA GTA ACC GM TTC GCT AAA ACC TGC CTA GCT GAC GAA TCT GCA GAA 
Glu Val Thr Glu Phe Ala Lys Thr Cys Tal Ala Asp Glu Ser Ala Glu 

200 210 220 230 240 

AAC TGC GAC AAA TCC CTG CAC ACC CTG TTC GGT GAC AAA CTG TGC ACT 
Aso Cys Asp Lys Ser Leu His Thr Leu Phe Gly Asp Lys Leu Cys Thr 

250 260 270 280 

CTT GCC ACC CTC CGC GAA ACC TAC CCT GAA ATC GCT CAC TGC TGC GCT 
Val Ala Thr Leu Arg Glu Thr Tyr Gly Glu let Ala Asp Cys Cys Ala 

290 ' BOO 310 320 330 

AAA CAG GAA CCG GAA CGC AAC GAA TGC TTC CTT CAG CAC AAA GAC CAC 
Lys Gin Glu Pro Glu Arg Asn Glu Cys Phe Leu Gin His Lys Asp Asp 

340 350 360 370 380 

AAC CCG AAC CTG COG CGC CTG GTT CGT CCG GAA GTC GAC GTA ATG TGC 
Asn Pro Asn Leu Pro Arg Leu Tal Arg Pro Glu Tal Asp Tal Bet Cys 

390 400 410 420 430 

ACC GCA TTC CAC GAC AAC GAA GAA ACC TTC CTG AAA AAA TAC CTG TAC 
Thr Ala Phe lis Asp Asn Glu Glu Thr Phe Leu Lys Lys Tyr Leu Tyr 

440 450 460 470 480 

GAA ATC GCA CGC CGT CAC CCG TAC TTC TAC GCA CCG GAA CTG CTG TTC 
Glu He Ala Arg Arg His Pro Tyr Phe Tyr Ala Pro Glu Leu Leu Phe 



[0 10 2] 
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(14) 1 137 5 

25 26 

490 500 510 520 

TTC OCT AAA CGT TAC AAA GCA GCT TTC ACT CAA TGC TGC CAG GCG 
Phe Ala Lys Arg Tyr Lys Ala Ala Phe Thr Clu Cys Cys Gin Ala 

530. 540 550 560 570 

GCT GAC AAA GCG GCA TGC CTG CTC CCG AAA CTG GAC GAA CTG CGT GAC 
Ala Asp Lys Ala Ala Cys Lea Leu Pro Lys Leu Asp Glu Leu Arg Asp 

580 590 600 610 620 

GAA GGT AAG GCG TCT TCT GCA AAA CAG CGT CTG AAA TGC GCT TCT CTC 
Glu Gly Lys Ala Ser Ser Ala Lys Glu Arg Leu Lys Cys Ala Sex Leu 

630 640 650 660 

CAG AAA TTC GGT GAA CGT GCA TTC AAA GCG TGG GCA GTT GCG (XC CTG 
Gin Lys Pbe Gly Glu Arg Ala Phe Lys Ala Trp Ala fal Ala Arg Leu 

670 680 690 700 710 

TOC CAG CGC TTC CCG AAA OCA GAA TTC GCA CAA GTG TCT AAA CTG GTT 
Ser Gin Arg Phe Pro Lys Ala Glu Phe Ala Glu ?el Ser Lys Leu Tal 

720 730 740 750 760 

ACT GAC CTG ACC AAA GTT CAC ACC GAA TGC TGC CAC GGC GAC CTT CTA 

Thr Asp Leo Thr Lys Tal His Thr Glu Cys Cys Els Gly Asp Leu Leo 

770 780 790 800 810 

GAG TGC GCA GAC GAC CGT GCG GAC CTG GCG AAA TAC ATC TGC GAA AAC 
Glu Cys* Ala Asp Asp Arg Ala Asp Leu Ala Lys Tyr He Cys Glu Asn 

820 830 840 850 860 

CAG GAC TCC ATC TCT TCT AAA CTG AAA GAA TGC TGC GAA AAA CCG CTG 
Gin Asp Ser lie Ser Ser Lys Leu Lys Glu Cys Cys Glu Lys Pro Leu 

870 880 890 900 

CTG GAA AAA TCT CAC TGC ATC GCA GAA GTA GAA AAC GAC GAA ATG CCG 
Leu Glu Lys Ser Bis Cys lie Ala Glu Val Glu Asn Asp Glu Met Pro 

910 920 930 940 950 

GCG GAT CTG CCG TCT CTG GCG GCT GAC TTC GTT GAA TCA AAA GAC GTG 
Ala Asp Leu Pro Ser Leu Ala Ala Asp Phe Tal Glu Ser Lys Asp Val 
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2T 28 

960 970 980 990 1000 

TGC AAA AAC TAC GCA GAA GCA AAA GAC GTA TTC CTA GGT ATG TTC CTG 

Cys Lys Asn Tyr Ala Glu Ala Lys Asp Val Phe Leo Gly let Phe Leu . 

1010 1020 1030 1040 . 1050 

TAC GAA TAC GCT CCT CGA CAC CCG GAC TAC TCT CTG GTT CTG CTG CTG 
Tyr Glu Tyr Ala Arg Arg His Pro Asp Tyr Ser Val Val Leu Leu leu 

1060 1070 1080 1090 1100 

CGC CTG GCA AAA. ACC TAC GAA ACT ACC CTG GAA AAA TGC TGC GCA GCG 

Arg Leu Ala Lys Thr Tyr Glu Thr Thr Leu Glu Lys Cys Cys Ala Ala 

1110 1120 1130 1140 1150 

GCT GAC CCA CAC GAA TGC TAC GCA AAA GTG TTC GAC GAA TTC AAA CCG CTG 
Ala Asp Pro Bis Glu Cys Tyr Ala Lys Val Phe Asp Glu Phe Lys Pro Leo 

1160 1170 1180 1190 1200 

GTT GAA GAA COG CAC AAC CTG ATC AAA CAG AAC TGC GAA CTG TTC AAA 
Val Glo Glu Pro Gla Asn Leu lie Lys Gin Asn Cys Glu Leu Phe Lys 

1210 1220 1230 1240 

CAG CTG GCT Gkk TAC AAA TTC CAG AAC GCT CTG CTG GTT CGC TAC ACC 
Gin Leu Gly Glu Tyr Lys Phe Gin Asn Ala Leu Leu Val Arg Tyr Thr 

1250 . 1260 1270 1280 1290 

AAA A AG GTA CCG CAG GT6 TCT ACT CCG ACC CTG GTG GAA GTA TCC CCT 
Lys Lys Val Pro Gin Yal Ser Thr Pro Thr Leu Val Glu Val Ser Arg 

1300 1310 1320 1330 1340 

AAC CTG GGT AAA GTT GGC TCT AAA TGC TGC AAA CAC CCG GAA GCG AAA 
Asn Leu Gly Lys Val Gly Ser Lys Cys Cys Lys His Pro Glu Ala Lys 

1350 1360 1370 1380 1390 

CGT ATG CCG TGC GCG GAA GAC TAC CTG TCC GTG GTG CTG AAC CAG CTG 
Arg let Pro Cys Ala Glu Asp Tyr Leu Ser Val Val Leu Asn Gin Leu 

1400 1410 1420 1430 1440 

TGC GTT CTG CAC GAA AAA ACC CCG GTT TCT GAC CGT GTA ACT AAA TGC 
Cys Val Leu His Glu Lys Thr Pro Val Ser Asp Arg Val Thr Lys Cys 

[0104] 
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29 30 

1450 1460 1470 1480 

TCC ACC GAA TCT CTC GTT AAC CCC CCT CCG TGC TTC TCC OCT CTA GAG 
Cys Thr Glu Ser Leo Val Asa Arg Arg Pro Cys Phe Ser Ala Leu Gin 

1490 1500 1510 1520 1530 

GTT GAC GAA ACC TAG CTA CCG AAA GAA TIC AAC GCA GAA ACC TTC ACT 
Val Asp Glu Thr Tyr Val Pro Lys Glu Phe Asn Ala Glu Thr Phe Thr 

1540 1550 1560 1570 1580 

TTC CAC GOG GAC ATC TGC AOC CTG TCC GAA AAA GAA CGC CAG ATC AAA 
Pbe His Ala Asp lie Cys Thr Leu Ser Gin Lys Glu Arg Gin He Lys 

1590 1600 1610 1620 1630 

AAA CAG ACC GCT CTG GTG GAA CTG GTA AAA CAC AAA CCG AAA GCA ACC 
Lys G1d Thr Ala Leu Val Glu Leu Val Lys Bis Lys Pro Lys Ala Thr 

. 1640 1650 1660 1670 1680 

AAA GAA CAA CTG AAA CCG GTG ATG GAC GAC TTC GCA GCT TTC CTA GAA 
Lys Glu Gin Leu Lys Ala Val let Asp Asp Phe Ala Ala Phe Val 61o 

1690 1700 1710 1720 

AAA TGC TCC AAA GCT CAC CAC AAA CAA ACC TCC TTC CCT GAA CAA GCT 

Lys Cys Cys Lys Ala Asp Asp Lys Glu Thr Cys Phe Ala Glu Glu Gly 

1730 „ 1740 1750 1760 1770 

AAA AAA CTG GTA GCT GCG TCT CAC GCT GCA CTG CGC CTG TAAIGATAGG 
Lys Lys Lea Val Ala Ala Ser Gin Ala Ala Leu Gly Leo 

1T80 
ATCC 3' 



ftM¥4-2 1 137 5 



[0 10 5] 6a*»J#*f : 2 
E2*J0>gS : 1 7 8 1 

m<D& : 



*JfK£STfB# : cleavage-site 
: 2 1 . . 2 6 

30 E#J 

[0106] 
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31 32 

W9UL (£7U«^2> 

10 20 30 40 

5' AA GCTTGGCATG CAC GCT CAC AAA TCC CAA GTT GCG CAC CCT TTT AAA 
Asp Ala Els Lys Ser Glu Val Ala His Arg Phe Lys 

50 60 70 80 90 

GAC CTG €CT GAG CAA AAC TTC AAA CCC CTG CTT CTC ATC OCT TTC OCT 

Asp Leu Gly Glu Glu Asa Phe Lys Ala Leu Tal Leu He Ala Pbe Ala 

100 110 120 130 140 

CAG TAC CTT CAG CAG TGC CCG TTC GA6 GAC CAC GTT AAA CTG OTA AAC 
Gin Tjt Leu Gin Gin Cys Pro Phe Glu Asp Bis Tal Lys Leu Val Asa 

150 160 170 180 190 

GAA GTA ACC GAA TTC GCT AAA ACC TGC GTA GCT GAC GAA TCT GCA GAA 
Glu Tal Tfar Glu Pbe Ala Lys Thr Cys Tal Ala Asp Glu Ser Ala Glu 

200 210 220 280 240 

AAC TGC GAC AAA TCC CTG CAC ACC CTG TTC GOT GAC AAA CTG TGC ACT 
Asa Cys Asp Lys Ser Leu His Thr Leu Phe Gly Asp Lys Leu Cys Thr 

250 260 270 280 

GTT GCG ACC CTG CGC GAA ACC TAC GOT GAA ATG GCT GAC TGC TGC GCT 
Tal Ala Thr Leu Arg Glu Thr Tyr Gly Glu let Ala Asp Cys Cys Ala 

290 * 300 310 320 330 

AAA CAG GAA CCG GAA QQO AAC GAA TGC TTC CTT CAG CAC AAA GAC GAC 
Lys Gin Glu Pro Glu Arg Asa Glu Cys Phe Leu Gin His Lys Asp Asp 

340 350 360 370 380 

AAC CCG AAC CTG CCG CCC CTG GTT CGT CCG GAA GTC GAC GTA ATG TGC 
Asd Pro Asn Leu Pro Arg Leu Val Arg Pro Glu Tal Aap Tal Met Cys 

390 400 410 420 430 

ACC OCA TTC CAC GAC AAC GAA GAA ACC TTC CTG AAA AAA TAC CTG TAC 
Thr Ala Phe- His Asp Asn Glu Glu Thr Phe Leu Lys Lys Tyr Leu Tyr 

440 450 460 470 480 

GAA ATC GCA CGC COT CAC CCC TAC TTC TAC GCA CCG GAA CIO CTG TTC 
Glu lie Ala Arg Arg His Pro Tyr Phe Tyr Ala Pro Glu Leu Leo Phe 
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33 34 

490 500 510 520 

TTC GCT AAA CGT TAC AAA GCA CCT TTC ACT GAA TGC TGC CAG GCG 
Phe Ala lys Arg Tyr Lys Ala Ala Pbe Thr Glu Cys Cys Gin Ala 



530 540 550 560 570 

CCT GAC AAA GCG GCA TGC CTC CTG CCG AAA CTG GAC GAA CTG CGT GAC 
Ala Asp Lys Ala Ala Cys Leu Leu Pro Lys Leu Asp Glu Leu Arg Asp 

580 590 600 610 620 

GM GGT AAG GCG TCT TCT Ga AAA ac CGT aG AAA TGC GCT TCT CTC 
Glu Gly Lys Ala Ser Ser Ala Lys Gin Arg Leu Lys Cys Ala Ser Leu 

630 640 650 660 

CAGAAATTCGGTGAACGTGCATTCAAAGCGTGGGCAGnGCGCGCCTG 
Gin Lys Phe Gly Glu Arg Ala Phe Lya Ala Trp Ala Val Ala Arg Leu 

670 . 680 690 700 710 

TCC CAG CGC TTC CCG AAA GCA GAA TTC GCA GAA GTG TCT AAA CTG GTT 
Ser Gin Arg Pbe Pro Lys Ala Glu Phe Ala Glu Tal Ser Lys Leu Val 

720 730 740 750 760 

ACT GAC CTG ACC AAA GTT CAC ACC GAA TGC TGC CAC GGC GAC CTT CTA 

Thr Asp Leu Thr Lys Tal His Thr Glu Cys Cys His Gly Asp Leu Leu 

770 780 790 800 810 

GAG TGC GCA GAC GAC CGT GCG GAC CTG GCG AAA TAC ATC TGC GAA AAC 
Glu Cys Ala Asp Asp Arg Ala Asp Leu Ala Lys Tyr He C/s Glu Aan 





820 




830 




840 




850 ■ 




860 


CAG 


GAC TCC 


ATC 


TCT TCT AAA 


CTG 


AAA GAA 


TGC 


TGC GAA 


AAA 


CCG CTG 


Gin 


Asp Ser 


lie 


Ser Ser Lys 


Leu 


Lys Glu 


Cys 


Cys Glu 


Lys 


Pro Leu 




870 




880 




8S0 




900 






CTG 


GAA AAA 


TCT 


CAC TGC ATC 


GCA 


GAA GTA 


GAA 


AAC GAC 


GAA 


ATGCCG 


Leu 


Glu Lys 


Ser 


His Cys lie 


Ala 


Glu Val 


Glu 


Asn Asp 


Glu 


let Pro 



910 920 930 940 960 

GCG GAT CTG CCG TCT CTG GCG GCT GAC TTC GTT GAA TCA AAA GAC GTG 
Ala Asp Leu Pro Ser Leu Ala Ala Asp Pbe Val Glu Sex Lys Asp Val 
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35 36 
960 970 980 990 1000 

TOC AAA AAC TAC CCA 6AA GCA AAA CAC GIA TIC CTA GGT ATG TTC CTG 
Cys lys Asa Tyr Ala 6iu Ala Lys Asp Tal Phe leu Cly let Phe Leu 



-211375 



1010 1020 1030 1040 1050 



TAC 


GAA TAC GCT 


CGT CGA CAC 


CCG GAC TAC 


TCT 


GTG CTT CTC 


CTG CTG 


Tyr 


Glu Tyr Ala 


Arg Arg His 


Pro Asp Tyr 


Ser 


Val Tal Leu 


Leu Leu 




1060 


1070 


1080 




1090 


1100 


mr 


nr iy»i in 


l/W f|A Oil 

ACC TAC GAA 


ACT ACC CTG 


GAA 


AAA TGC TGC 


GCA GCG 


Arg 


Leu Ala lys 


Thr Tyr Glu 


Thr Thr Leu 


Glu 


Lys Cys Cys 


Ala Ala 




1110 


1120 


1130 




1140 


1150 


GCT 


GAC CCA CAC 


GAA TGC TAC 


CCA AAA GTG 


TTC 


GAC GAA TTC 


AAA CCG CTG 


Ala 


Asp Pro His 


Glu Cys Tyr 


Ala Lys f el 


Phe 


Asp Glu Phe 


Lys Pro Leu 




1160 


1170 


1180 




1190 


1200 


cn 


GAA GAA COG 


CAG AAC CTG 


ATC AAA CAG 


AAC 


TGC GAA CTG 


TTC GAA 


Val 


Glu Glu Pro 


Gin Ash Leu 


He Lys Glu 


Asn 


Cys Glu Leu 


Phe Glu 




1210 


1220 


1230 




1240 




CAG 


CTG GCT GAA 


TAC AAA TIC 


CAG AAC GCT 


CTG 


CTG GTT CGC 


TAC ACC 


Gin 


Leu Gly Glu 


Tyr lys Phe 


Gin Asn Ala 


Leu 


Leu Val Arg 


Tyr Thr 



1260 * 1260 1270 1280 1290 

AAA AAG GTA CCG CAG GTG TCT ACT CCG ACC CTG GTG GAA CTA TCC CGT 
Lys Lys Tal Pro Glo Tal Ser Thr Pro Thr Leu Tal Glu Tal Scr Arg 

1300 1310 1320 1330 1340 

AAC CTG GGT AAA GTT CGC TCT AAA TGC TGC AAA CAC GCG GAA GCG AAA 
Asn Leu Gly Lys Tal Gly Ser Lys Cys Cys Lys His Pro CU Ala Lys 

1350 1360 1370 1380 1390 

CGT ATG CCG TGC GCG GAA GAC TAC CTG TCC GTG GTG CTG AAC CAG CTG 
Arg Bet Pro Cys Ala Glu Asp Tyr Leu Ser Tal Tal Leu Asn Gin Leu 

1400 1410 1420 1430 1440 

TGC GTT CTG CAC GAA AAA ACC CCG GTT TCT GAC CGT GTA ACT AAA TGC 
Cys Tal Leu Bis Glu Lys Thr Pro Tal Sex Asp Arg Tal Thr Lys Cys 
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37 38 
1450 1460 1470 1480 

TGC MX GAA TCT CTG 6TT AAC OGC CGT COG TGC TO TX GCI CTA GAG 
Cys Thr Glu Ser Leu Val Asn Arg Arg Pro Cys Phe Ser Ala Leu Glu 

1490 1500 1510 1520 1530 

GTT GAG GAA ACC TAC CTA OCC AAA GAA TTC AAC GCA GAA ACC TTC ACT 
Val Asp Glu Thr Tyr Val Pro Lys Glu Phe Asn Ala Glu Thr Pbc Thr 

1540 1550 1560 1570 15S0 

TTC CAC GCG GAC ATC TGC ACC CTG TCC GAA AAA GAA CGC CAG ATC AAA 
Phe Eis Ala Asp lie Cys Thr Leu Ser Glu Lys Glu Arg Gin He Lys 

1590 1600 1810 1620 1630 

AAA CAG ACC GCT CTG GTG GAA CTG GTA AAA CAC AAA CCG AAA GCA ACC 
Lys Gin Thr Ala Leu Val Glu Leu Val Lys His Lys Pro Lys Ala Thr 

1640 1650 1660 1670 1660 

AAA GAA GAA CTG AAA GCG CTG ATG GAC GAC TTC GCA GCT TTC GTA GAA 
Lys GU Gin Leu Lys Ala Val let Asp Asp Phe Ala Ala Phe Yal Glu 

1690 1700 1710 1720 

AAA TGC TGC AAA GCT GAC GAC AAA GAA ACC TGC TTC GCT GAA GAA GGT 
Lys Cys Cys Lys Ala Asp Asp Lys Glu Thr Cys Pbc Ala Glu Glu Gly 

1730 v 1740 1750 1760 1770 

AAA AAA CTG GTA GCT GCG TCT CAG GCT GCA CTG GGC CTG TAATGATAGG 
Lys Lys Leu Yal Ala Ala Ser Cla Ala Ala Leu Gly Leu 

1780 
ATCC 3* 
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[01] 



* AACCTTGOGA- 
CAAAACTTGA 
OAGGACCACG 
CAATCTCCAC 
CTTCCGACCC 
CAACOC AACG 
CCTCCCGAAG 
AAAT ACCTCT 
TTCOCTAAAC 
TOCCTCCTGC 
CCTCTOAA AT 
OCCCGCCTGT 
CACCTQACCA 
CCTGCGCACC 
O**TGCT0CG 
CAAATGCCOC 
AAAAACTACC 
CG AC ACCCCO 
CTOOAAAAAT 
TTCAAACCGC 
CAGCTGGCTG 
CACGYGTCTA 
TCCTGGAAAC 
CTCAACCAGC 
TGCACCCAAT 
TACCTACCCA 
TCCGAAA AA 6 
C CG A A AG C A A 
AAATGCTOCA 
CCTGCGTCTC 



TOGaCGCTCA, 
AACCCCTCCT 
TTAAaCTCGT 

aaaactgcca 
tgcgcgaaac 
aatgcttcct 
tccacgtaat 
acc aaatccc 
ottac aaacc 

CGAAACTCGA 
CCCCTTCTCT 
CCCACCCCTT 
AACTTC ACAC 
TGCCCAAAT A 
AAA AAC-CQCT 
CCGATC^TCCC 
CAGAAGC AAA 
ACTACTCTCT 
GGTGGCGAOe 
TGGTtGAAGA 
AATACAA ATT 
CTCCCACCCT 
-ACCCCOAACC 
TCTOCGTTCT 
CTCT0OTTAA 
AACAATTCAA 
AACGCCACAT 

ccaaagaaca 
aagctgacca 
aggctggagt 



CAA ATCCGAA CTTOCGCACC CTTTTAAAGA CCT6GOTOAG 
TCTOATCCCT' TTCCCTC ACT ACCTTCAGC* OTGCCCOTTC 
AAACGAAGTA ACCOAATTCO 1 CTAAAACCTG CCTAGCTCAC 
CAAATCCCTC CACACCCTCT TCOOTOACAA ACTGTCCACT 
CTACOGTGAA ATCCCTO ACT ' 0C TGCOCTAA ACACOAACCC 
TC ACC AC AAA CACOACAACC CGAACCTOCC CCOCCTCGTT 
GTOGAGCGCA TTCCACC ACA ACCAAGAAAC CTTCCTGAAA 
ACGCCOTCAC CCGT ACTTCT ACOCACCCGA ACTCCTOTTC 
AGCTTTCACT OAAT06TGCC AGGCOGCTGA CaAAGCGGCA 
COAACTGCGT. CACGAAOGTA AGCCGTCTTC TGCAAAACA6 
CCACAAATTC CCTOAACGTC CATTCAAAGC GTGGGCAG^t 
CCCCAAAGCA CAATTCGCAO AA6TGTCTAA ACTGGTTACT 
C6A ATOCTGC CACCGCCACC TTCTAGAGTG CCCACACOAC 
CATCTCCGAA AACCAOOACT CCATCTGTTC TAAACTGAAA 
OCTCGAAAAA TCTCA-CTGCA TCCCACAAOT AflAAAACGAC 
CTCTCTCGCG CCTOACT-TCC TTCAATCAAA" AGaCCTGTCC 
AOACGTAtTG CTAGOTaTGT TCCTCTACCA ATACOCTCGT 
CGTTCTGCTO CTOCCCCTOG CAAAAACCTA CGAAACTACC 
OOCTCACCCA CACCAATCCT ACGCAAAAGT CTTCGACGAA 
ACCGCAOAAC CTOATCAAAC AGAACTCCOA ACTGTTCAAA 
CCAOAACGCT CTOCTGOTTC GCTACACCAA AAAGGTACCC 
GCTGOAAGTA TGCCCTAACC tOCGTAAACT TGCCTCTAAA 
OAA ACGTATG CCCTOCCCGG-AAGACTACCT 0TCCGTGGT6 
OC ACC A AAA A ACCCCCCTTT CTGACCOTGT AACTAAATCG 
CCCCCCTCCO TOCTTCTCCO CTCTAOAOGT TCACGAAACC 
COCACAAACC TTCACTTTCC ACCCGCACAT CTCCACCCTG 
CAAAAAACAG ACCGCTCTCG TOCAACTCCT ^*^A6AAA 
ACTC AAAGCO OTOATOCACC aCTTCOCAGC TTTCtTAGAA 
C AAA p AAACC TCCTTCGCTG AACAAGGTAA AAAACT6GTA 
COCCCTOTAA TGATACC ATC C V 
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l 

AipMiHiiL7iS<rCluVt IAlaMUAr|Ph<LMA8pliuClyClu 
AAG CTTGGG ATG G AC G CTC A CjAA AT C CGAACTTG CCC AC CGTTTTA AAC AC C TG GGTG AC 
TTCGAACCCTA C CTGCC AGTGTTTAjGG CTTC AACCCCTGGC AAAATTTC TGG ACCC ACTC 
Fokl'TSl 9 tt 6 9 tt ' 



[S3] 



[05] 



100 200 300 400 500 



SD 



Met Axe AU His 



Kpnl 



PslI 



Xbal 
SpW 



» J CGATTAGTAAGGAGGTTTAAAATGGACGCTCAC 

TAATCATTCCTCCAAATTTTACCTGCGAGTGTTTA 



Cli I eohoiva 



Fok t cohesive 




pHSA 



[07] 



(A) (B] 
M123MM12-3M 



II 1] 



94K 
67 K 
43 K 



30K == — 



20K — 



Noli Sae 1 Sua I Ecoft I 

5 * CGGC jg GCCC 'C GC AC C T| C C C|C C g |a AT T C 

CCCCGC|CGCCfrCGAGGGjcCCTTAA|GACCT 
Htpll cohMlvt S«H 
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[04] 



AGCTTCGCATCCAttCTtACJUATOGMOTTCC^^ 
ACCCTACaCCCAGTCmAOiCTTCMCGCra^ 

wtfrrcwACcccT(OTTctwTC(STnccck^ 

TOXCACCAAUnACCDAAACCGACTUTCeUCTCCTtACCCOCAAIJCT 

CCACtrrTAAACTCGTAAACCAAGT AACCWATKCCTAAAACCTCCCTACCTCACCAATCTCCA 

TTCACCATntCnCATTWCnA^CCATTTTWACCCATCCACreCTTAG Pill 



Pjtt 

* ■ CAAMCTTCGACAAATCCCTtC^CACCCTCTTtCCT GACAAAC TTr^ 

A U 'J t i l l l (, A Ut I0 IT rAG6UCCTCT«CACAACCCACTCTTTCACACCTCACAACTCT 

CAAACCTACGtrrGAAATCCCTCACTCCTCCCCTAAACACGAAtC 
CKACCCCCrntCATCCCACTTTACCCACTOACCAMCCAl MGILtJ IbCCCTTGCCTTCCTT 

TOCCAC/AAacCACAACCCCAACtTOCCCCCCaCCTTCCTCCCaAC u\t 
ACCAACCAACTCC1 



SUI 

TCCACCTUTrreCACCCCAnCCACWWACC^AAACCTTCCTCAAAAAATAC 

C CAHACACCTCC CCTAACCTCCTCrrcnTCTTTCC AACt ACTTTTTTATCC ACATGCT 

CTCTACUUKCaCCCCCTCACCCCTACneTACDCACCCCUCTCCTCTTm 
UACC^CCGCAJntCCCAT^WTCaTTlCCmCACCAPiA^ 

CTAAACCnACAAACCA^TnCACTGAATCCTCCCAGCCCCCT^ 

ATCmCCTCCAAACT(WAC6A«^CCCCaACTCmttCC S»bl 



CCTCnCCCIUAJCTCACCAACTCCCTCAKAACCTAACCC C TCT fCr CC AAM 

CTACccAceucoccmcAcnt^ACGuaccTTCCAnccccAaACA Ccn i ai cccAS - 

CACCCTCTCAAATCfCCTTnaCCACAAAnCCCTt^COT 

ACmACCCCAAGACACGTCmAAKCACTTCUCGTAACTnCCCACCCCTC 

CTftGCAGTTtCCCCtC7CT(mGttmcmAAAGCAW 

UCCCCttWCACCCTCaCAAttCCnTCCTmAAtmCTTCACAarrTC 

CTaAAAncmACRAaTCACCAAACmACACCCAATCaCCCAaZCCACCTr - Ibit 
ACCAATWCTCC ACTOTTTTCAAirrCTCCCTTACCA CGCTCCCGCTCCMCArC 

Xhil 

CTAGAtrrctt^GCACCCTGIXCACCTGGCC^ 

TCACCCCTCTCCICCCACCCCTCCACCCCmATCTACACOTTTTCCT 



CTCTTCTA AACTGAAAGA atcctccc aaaaa ccc ctcctc ga aaaatctc actgca it gcagaa 

nCACTTTCTTACCACCCTTrrTCGCCACCACCTnTrACACnU 

CT A GAAAAC CACGAAATGCCCt CCG ATCTCCCGTCTCTGGC CCCTCACTTOCTTCA ATCAAAACAC ' 

" CCTCq^ACGCCCCCaACACGCCACACACCCCCCACTCAACCAACnA l t 1 1 it i ii CAacen 

CTGTGCAMAACTACG£AGAAGCAAAA4UCG7ATTCCTAGCT^ U 1 1 

TTTt ATCCCTCTTCCn TTCTCCATAACCATCC ATAC AAGGACATCCTC A TCCCAC CAGCT 



®4B 



CTCCAAAAATOCJCgCCAC^ 

TTe* A *CCCCTCC I TCAA ? ^gCg H CA 4 CCTO^ 

CAOCTCCCTCAAJA.CA.A^ 

„ rt4 eceIS8I8?Xm^«SI?«iSn«««E4*ni«SSI?KIISIIWi? TeT " CT * 

c**.»»»c» g »j«eTeT|eTe ? AA|T ? g WW ^ g ^ t ^3eg^»|^ tecTTT6T 

ACeA.ACAAC^eTe^Ag^gTg.TgCAJgA^ 
AAAAATCCTgg m ggJg4ggAeAA»e«*^£T ? g I ^ 

TAA A AAACTCgTA,gg Jggg Jgjg^ggg Jgg^g Jgg q§£ A? Xffrl? T ATC CTA , 

S *C 



— 605 — 



(24) 



ftm¥4-2 1 13 7 5 



[06] 



[08] 




»T1S8 0o<0 Iflb 




RikJ 



Barrel amiewaja^mxumxnxK 
Fokl 



,Ukb 



I Barrel 
CI • I 



Lire* Irogient 
U kb 



J 



Dbterted : Rtl-Aip-Ala-BU-Lri-Sir-Clii-Yal- 
Pradlcled i Uct-Asp*AU-flll-Lyi-air-ClQ-Vil- 



fl CUt SD JkLtoAliH.UiSr 

MQt tc« ogtaMMoao t • l|CWTT A5T AAQijAQOTTTAAA ATGSaCQCT £KCaaa too 
it nag tpca 1 1 1 1 teeca 1 mat$Ai 




ili-Bli-Arf-Phe-Lji-Up-Lea-Clr 
m-Bti-Afi-Pha-l?a-A»p-LtQ-Cly 



[010] 



o a X J} 8 1 

LtuVaiLauAlaGlyPn AlaAl >A 1 *S« r Al iGl uTbrA I &At nLy iSa rAtn 

TTCCTTCTCCCACqACCCCCCCCTCCCACTCCri^AACCCCO>UVCAAATCCAAT 

»" i 

c 



BaaHl 



[012] 



[09] 




Ecom 



EcoR! KlndS 

Sail 




pUBIIO LSkb 



pTUBBIAIOt H if* II It* 



HMI11 h*n HUlll 



PIUIBT1 

IT to 




l*tf tW In ill (If M AH, 111 Ua •— - J" * 

I 1 1| ( l*1inmi« Cfl OC pOC C O>COAOCT CCcjOQOAAT TC i f itflQI* 
itltn I t ,icaCCe<HCOCCTC9A80cJcCCTTAAOAOCT 



PA0IC1 

II tt 



as- ■-7i?-* 
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1 1 3 



II 3] 



[014] 



Not 



AGCTT6T AATG ATAGC OOCCOCOO AGTOCTOAC6CTC AC 

ACATTACTATCGCCOO CGCCT CAC9ACTQCGAGTQTTTA 



Hlndlll t*b»*.l». 



Hindi I.| 1 1 ■ B I 



Foki cohitlti 




JJ IAI1CI tUCIINtA 
19 Mb if Ml 



Is 



" — r 



» Mail n 

Ml lM Ala *fr fn kU. Um Um ft* Ati* 
•II •«■■■•«■••• iv C6A0T OCT a ACeCTC AC* ■ ■«•;••••• 
CGCTC ACQA CTQCfl AO T 0T T T Af • 





□ uia 



117] 



[015] 



2$(M25^6A^32 1^ 8 A 2 ^1 0 ' toflr\Jxo\h*5 



Slandarts • # • 



lAhr 


• • 


♦* • 




• 


16 hr 


• f> 


• • 


• • • 


• 


18hr 


• • 




• • • 


• 




t 2 


3 4 5 6 


12 3 A 


5 








Id* 5 





Hlndm 
Kph! 
, Sail 
Pstt 

Xbal 
SpN 

Barrel 



100 200 300 400 500 



123 



A5 



678 



PKSAE2 
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[01 6] 



AACCTTGGCA 
G AAA ACTTC A 
CAGCACCACC 
CAATCT6CA0 
CTTCCCACCC 
CAACCCAACC 
CCTCCCCAAC 
AAATACCTCT 
TTCCCTAAAC 
TOCCTCCTOC 
CCTCTOAAAT 
CCCCCCCTCT 
OACCTOACCA 
COTCCCCACC 
C A-A»T 0 C T C C C 
CAAATCCCCO 
A A AAACTAC 6 

ccACACcecc 

CTGQA AAA AT 
TTCAAACCGC 
CACCTCGCTO 
CACOTCTCTA 
rCCTCCAAAC 
CTGAACCAGC 
TCCACCGAAT 
TACCTACCCA 
TCCGAAAAAC 
CCCAAACCAA 
AAATOCTGCA 
OCTCCOTCTC 



TOG ACCCTCA 
AAGCCCTCCT 

ttaaactcct 

AAAACTGCOA 
TCCCCCAAAC 
AATCCTTCCT 
TCGACCTAAT 
A C C A AATCCC 

cttacaaacc 
coaaactcoa 

GCOCTTCTCT 
CCCACCOCTT 
AAOTTCACAC 
TCCCOAAATA 
AAAAA&CGCT 
CGCATtfTOCC 
CAGAAGCAAA 
ACT ACTCTCT 

cctgcccacc 
tggtto aac a 
aatacaaatt 

CTCCGACCCT 
ACCCCCAACC 
TCTGCCTTCT 
CTCTGCTTAA 
AAGAATTCAA 

aaccccagat 
ccaaaoaaCa 

AAGCTOACGA 
AOGGTCGACT 



CAAATCCGAA 
TCTQ ATCGCT* 
AAACCAAOTA 
CAAATCCCTO 
CTACOGTCAA 
TCAGCACAAA 
GTGCACCGCA 
ACCCCCTCAC 
AGCTTTCACT 
CC A AC TGCCT 
CCACAAATTC 
CCCOAAAGCA 
CCAATOCTCC 
CATCTCCGAA 
GCTCCAAAAA 
OTCTCTCCCG 
ACACOTAtTC 
CGTTCTGCTB 
CCC.TCACCCA 
ACCOCAGAAC 
CCACAACGCT 
G GTCGAAOTA 
CAAACGTATC 
CC A CO A AAA A 
CCGCCCTCCO 
COCAOAAACC 
C A AAAAAC AG 
ACTGAAAGCC 
CAAACAAACC 
CCGCCTOTAA 



CTTCCGC ACC 6TTTTAAAGA 

TTCGCTCAGT "AC CTTCAGC A 
ACCGA ATTCG * CTAAAACCTC 

CACACC9TGT TCOOTOACAA 

ATCOCTGACT CCTGCCCTAA 

C ACC AC AAC C CCAACCTCCC 

TTCCACGACA ACCA AGAAAC 

CCGTACTTCT ACGCaCCGGa 

CAATGCTCCC ACOCGCCTGA 

CACCAAGCTA ACCCGTCTT-C 

GCTCA ACGTG CATTCAAACC 

CAATTCGC AG AAOTGTCTAA 

CACOGCGACC TTCTAGAGTfl 

AACCACCACT CCATCTCTTC 

TCTCACTGCA TCCCAGAACT. 

GCTCACT.TCG TTGAATCAA* 

CTAGCTATGT TCC>GTACOA 

CTCCOCCTGG CAAAAACCTA 

CACCAATCCT ACOC AAA ACT 

CTCA-TCAAAC AGAACtCCCA 

CTGCTCCTTC OCTACACCAA 

TCCCOTAACC TOGCTAAACT 
CCCTOCGCCO— AAGACTACCT 

ACCCCCCTTT CTGaCCGTOT 

TGCTTCTCCC CTC7AGAGCT 

TTCACTTTCC ACGCGGACAT 
ACCGCTCTGO TCCAACTG6T 

CTGATGGACO ACtTCCCAGC 

TGCTTCGCTO AAGAAGCTAA 

tGATAGGATC C I* 



CCtCGCTCAC 
CTCCCCGT7C 
CCTACCTGAC 
ACTGTGCACT 
ACAGOAACCG 
CCCCCTOGTT 
CTTCCTCAAA 
ACTCCTCTTC 
CAAACCGCCa 
TGCAAAACAC 
GTOOGCAPTT 
ACTGCTTACT 
CGCACACOAC 
TAAACTC AA A 
AG AAA ACG AC 
AGACGTGTGC 
ATACGCTCCT 
CCAAACTACC 
GTTCCACOAA 
A CTCTTC CAA 
AAACGTACCO 
T GGCTCT AAA 
GTCCCTGCTO 
AACTAAATCe 
TGACGAAAGC 
CTCCACCCTC 
AAAACACAAA 

tVtcgtagaa 
aaaactgot a 



mi 9] 



[02 01 



Hindi II 




pTUSftco) 12)6 

reim 

|ap V iB^m 



Fokl 



B*<nHl| CBK^JOM^O^htMnOiCaaiK J Ban* I 



Li roast fragMnt 
| Ufcb 



Larue fraoaent 
, 16 kb 



H "Clfll SD JtaJpAlaHsba&r 

Bftflttc»cgt*at88fl00ti tfcCMTA(TTW6GAWmAAAATGGAmTOT«Mtc 
ttcMotnc»tttt1cccatiBc!rAATCAnCCTCCAAATTTTACCTCCCA6T0nTA| 



Not I 



AOCTTGT AAT8ATAGC OOCCOCGOAQTOCTO ACG CTC AC 

ACATTACTATCCCCGG CCCCTC ACGACTOCOAGTQTTTA 



Hindi II oohrslva 



Rind I'll lull 





pH$AE2 r^F^kl 

4.ltt Ftkl 



IFohl 



Larfltit frt|Mni 
■ LI Kb 



Hindi 1 1 v«ot l 



FUC33H3AE 
19kb 



Cta I 




pSDHSAE \Z 
14* 



& bHJ 
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[01 8] 



To y * I 

WCTTCCCATCC^CCCTUCJUATC CO^CTOCCCACCCTTTTAAACACaCCCKACfi W 

AceaAarcwACTrcnAGCcncAAcaira^ 

lACTTCAAACCCCTGCTTTrCATCCCTnCCCTCACTACCTTCiCCAn^CCCCnC 
TCCCCACtAACAmGCaUiUtt^ 

CUCCnAAACTCCTAAACC AArTAACCGAATTOCCTA AAACCTCCCTACCTCACCAATCTCCA 

TICACtATTTCCntATTCGCnAACCGATTTTl^^ P,tr 



Wbl 

CCTCCTCCCCAAACTCCACCAACTCCCTC ACT AAGCT AACGCCTCTTCTCCAAAA 
CTACCCACCACa^GACaCCTT^CCaCTCmCamCCCACAACA Ul 1 1 lU CCCAC 

UCCCTCTCAMTCCCCTTnCTCCACAMTTCCnCAACCTCCATTCAAACC 

ACTTTACCCCAAC AGAOCTCTIT AAWCACTTCUCCTAACTTTCCCACCCCTC 

crccciarrrcaccccTCTcecAttc^ 

AACCttCaACACCCTCCCCAAOKXTrTCC^ 

GTC1 AAiCTCCnACTCAC CTCACCAMCnCA UCCt MTCCTCCCACCCCCACCTT . Xbtl 
ACCAATt^TCWCTCCmCMCTCTOXTT.W^ 



Hit 

' ■ ^ACTCCGAC^TCCCTGaCACeCTCTTH^ 
ACGTCTTTTGACGCTOTnAGCCACCTGTtG^ 

CAAAeCTACCCTCJUATUOTCAXTCCTTCtnMACAGl^ 
CGCACSCCCTTTUATCCCACTTT&CCCACTCACCACCCCATTTtrrcCT^ 

TCACCACAAACACCACAACCCtAACCTCCCCtCCCTCCnCCTCCCCAAC lilf 
ACCAACCAA(ntC7CnTCTtXT(rnJHUjCnCCACCW 



<UI 

TCCACCTAATCTCCACCCCATTCCAOCACAACCAAC AAA CCTTCCTGA AAAAATAC 

CUTTACACCTCCCCTAACCTCtlGl 1UCI ICnTCGAACGACTTTTTTATCOACATCa 

CTmca.\ATCCCACCCCCTCACCCCTAmaACGCACCCCAACTnaCrrCTTOC- 
TTACCCT^UCTCCCaTCAAaTCCCTCCCCTTWCaCAACA^ 

nAAACmACAAACCACCmCAnCAATCnCCCACCCCCCTCAI^CCCCCATC 

ATCmCCTOA.«CT«CTTJU^CCTCOCCCCACTCTTTCCCC SphI 



Xkil 

CTAGAGTCaCAaCaCCCTOBWrtCttCAAATAU^ 
TUCCCCTCTGCTCCCACCCCTtXACCCCTnATn 



CTCTTCTAAACnWUlACAATCCTCC CAAAAACCC CTC 

TTCACTTTCTTACCACCCTTITrGttEACCACti i tTTACACTCACCTAXCC ICUCATCTTTT 

mCAAAAKACCAMTCCCKCC^TCTCCCCKTCT^ • 
«TCCXTTAC66CCCCCTACACC«WACACCCCCCACTCAA6CAAm 

6TCTCCAAAAACTACGC AC AAC CA^AlUC CTATTCCTACCTATCTTtCTtTA CCAATACPCTCC Silt 
TTTGAT6DGTCTT UJ 1 1 It l O CATAAJOATCaTACAACaCATCmATCCCACaca 



. en* a 



OTCSOC C TC AT G AC A C AC C A AC ACG ACCA CG fcGG A CCCTTT T TC G ATGCT.TTC ATGC C ACCTTTT 

C TCCA A AA AT CC TCCC CACCCCCTCACCCaCACCAATCCTACCC AA AAOTOTTCOACC A A 

• Ti CG ACCCOTCCCCCACTGCCYCTGCTTaCCATCCCTTTTCACAAGCTCCTTAACTTTCC 

TTe "" e iSI?gIIi«S^8S?H?«sm*«n««miSgi4«2SIIgg^ 0 Tcc«ec 

"'"'cGCAOCTCTCT^CTCCCACCCTCCTQe A ACT ATC CCCTAaCCTCCCT AAA6TTCGCT 

catcccctccacacatgaccctcccaccaccttcataScccattccacccatt 

TC*Aeco*gJ*«IggIgg*«g»ggggg*?ggg«4ggI«Iggg?Igggggg^g4gI$"" Teec 

« e At J »eccIggIggIgWgg«gI?IgggIIgJgg*gg«««ggggggIIIgIg«ggg T « T ""* 

«*TTo»7«Xg8Igg«gg*«glfIggn«8gg8ggJggg«S]KgIggggI.*Tc "•«• 
ttsi JggIJg«8««8I*88«88g«W8$«I8f*888M*M88I/i«To*»Afl 

e "*""fg»«gggIgIg?Iig*«IggI«m*g4*tggg*^gg4 T eeTTTCT 

""""*g«gIS4«ggg?Ig«gg*gg*gIIgigtggIIIg?I«TTTTT»eC 
T "*""Igg^ggIggilgIgJgggIgg«IggiggIgI«Ig«?gcT«0 ..«„« 
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SSWiElJ 

[Mje^*«s«] mm 

0Tfc*. 

[021 t hJlLfll7^5 >©N*«f*jfi»cm-©SIK 
S^«Stt£8At3fcaicE@LfcF o k I BMM 

[031 fie^^wsifssjSgsfiroisss^-rB-e* 

30©4>mW^Dyi'©ffi«. S.tfpHSA]&t#8 

[04 A) 0 4 Alifc htkft7)l'~7S. >SH5iH83S©;fc 
D N A$SE«t^l/fcD N A t U y 



£ 1 £j*T0T&£. 
[04 B) 04Blit hJfrfll7JK/5>»£T«*©fc 
ft. D N A^MT^bfcDN At 'J ^V-oyn 7 

* 2 £^-T0T*S. 

[04 CI 04CttfchJto»7;^S>«£7S!!B©fc 
». DNA^»T^l/&DNA*iJ^T-O^D7 
£3£t*1-0T&S. 

[04D1 04Dttt hJflfi!7;V^5>»^f*a©fc 
ft. DNA-&^«T-&fiKL/&DNA^U^T-©^n 5 ' 

[04 El 04 EHk hJH^7J^5 >ffle^ffi«©fc 
», DNA^fflT^lfcDNAtU^V-oyDy 

* 5 S^-T0T?feS. 

[04 Fl 04 F«t \~lkm7)V7S. >»£T8l£©fc 
». D N A6««-C^L.fcD NA t 'J dT-O^Dy 
£ 6 &7jVr0T&*>. 

[04 Gl 04 G«t hJkflj7;^5 >ffl&F»*©fc 
DNA^fiaT^UfcDNAtUdT-CD^Dy 

* 7 Sr^T0-e»S. 
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[04H] S4H«b:Mfilfi7^^>ae^#}^0^ 
^8&7jVr0T*So 

[05] *»bfcfchJhfll7^S>ft*»«©5»B^ 
ttttf S * fc^fi L, fc^&^D N A £tk-TEIT 
*So ft*, SDI1 U^/-A«g£$ffifcgTo 

[06] isirct hii7;^5>€*HTSit 

5^7^^HpSDHSA4 (Dtt^HS^fHT?* 
So ft*. ^7X5 HpTI 3S (Nco) fit *SS® 
t r p^n^E-^— *«lfi«(0^773 KT&So 
[0 7] #U7^U;V7^ \*9a&WlBXXtVx.X9> 
^Dyr-f >yHT?»*. SWHfcSB^Si (A) ttSD 
8*07*0*73 PBJUMHfc ^-^^-^T 
*>;WJt*ftftl/fcHT*5. (B) fit (A) © 
1/3 0iOie*«^TH«otft»lli»» WW© 

5/T-f>^l/:0?^^ B*©1. 2, 3, M<Z>tt# 
1*&T<&jI9T*S. 1. HB10 1£M#SG2. H 
B101/pSDHSAE124I#ie3. HB 1 0 

i/psdhsaei 2m&m#M. frm^-jj- 

[0 8] «B«ttS6©73/BEMft«*BTft*, 
Observed fi**Bfc«*;£n&EaU Predi 
c t edtt?£UA:e^^T. 

[0 9] ->t hM?*-pBU4 3 7 10«W*t 
0T&So 

[010] a-75 9— tf©5MBfcfi«-r*0»IWlC 

(A 1 a 3 3) #iK07S/BEWfttfDNAEM«« 
"T0T*So ft*, ^WJi, Al a33Sa-mE 
^GCTfrfcGCCKB&rSEiiKlckoTNo t I 

To 

[011] »»^^^-**O^J&tCfPHUfc^riEDN 
A£?SfBT*S. 
[012] »^?-PASEC l©i®iaT*S 8 
[013] pASEC 1 IC»«t*t»Ot hJfil»7;V 

[014] t h«H7Jk/3 >*tt«rWHBT*fca 
C0^X5 H P AMY 3 3 H S A 4 O>fll£g0T*So 

[015] 1 A 5 1 0/pAMY 3 3HS A4 Sfcte 1 
A 5 1 0 /p AMY 3 3HSAE2RIX1 A 5 1 0/p 
BU4 3 7 10«tM, 16, 1 8ttBB4>tt£-kni 
1 /* 1 £ 10X1X5 @:MD>;A hU 

at hms7;k/s >a*T^ □ 7^ >^ufc0T& 

So Standardslt SI GMAOE s s e n t 
ial globlinfree HUMAN Alb 
urn i n*ffll>&. B r o t hOfi:Ei:lt 
yhhfco HtCl. 2, 4, 5 OttBlCtt 1 A 5 1 0 
/p AMY 3 3HSA4£fcfcJ:l A 5 1 0/pAMY3 



3HSAE2^ 3. 6(^ffiS«C«l A5 1 0/pBU 

4371 **n*nx*y Mfc. 

[016] *56W«S#RfrU 
L&, fchMrn^^ta-PTftDNAEHftS 
T0T&S. 

[01 7] »e^+o®liBBSIS5fi[^Eg^^T0T$> 
So ft*. XBJS^o y ^ l d>6yn 7? 8 M©fll8 
t. 3^© + WttyoyjrO«WU WpHSAE2# 

[01 8 A] 018 Afitt hJfiL»7;k^5 >Sfc^»Sg 
CD&», DNA-&J*«7?^rtbfcDNA*y=fT-©^ 

[0 1 8 b] 0 1 8 Bttt hmT)i>?$ >m&=?m& 

n^2£^T0T**o 
[0 1 8 C] 0 1 8 Cfitfc hMtTAf?* 
©fcft, DNA^j£«T^£L£DNA*U:JV-CD:7 
n*/^3^T0T?*So 

[01 8 D] 0 1 8 Dfctt hit* 7/K/5 >aer*s 
(Dfctf), DNA^ar^b^DNA^Urf^-O^ 
n^^4^"T0T&So 

[0 1 8 E] 0 1 8 Ettt hJW»7^S>Be?«B 
Ofc*. DN A^iT^flEU&DN AtU ^T-O^ 
n^dr 5&5j^~0T$>So 

[0 1 8 F] 018 Fl*t hJfc»7JV:/5>Be?«IB 
<Dfc&, DNA*ja«T£rtLfcDNA*U:f^-0:/ 
D*;J 6£^~0"C&S. 

[0 1 8 g] a 1 8 Gttt hjfefS7j^5 >3ie^«t6 

©fcafc, DNA-&*itft«l/fcDNA*«J.^-0? 
n-y^7«*-TBT»S. 

[01 8 H] 01 8H«thlfilfll7;^5>ae : ?«|^ 
CD£», DNA-&*«T-&J*UfcDNA*U =rsr-©^ 

[01 9] *BUfcth*«7^ys>**B«T«a 
"TS^XS HpSDHSAEl 2<D«^Jg^"T0 
T*So ft*> ^7X5 FpTl 3S (Nco) fit, * 
litrp yD^-^-S:^aoy7X^ FT* 

So 

[020] pasec i fc«>rrsfc»©t hlM»7;V 

^5>»e?Oflll60T*So 

[0 2 1] thJk*7J^>fc*WBT»»'r*fc* 
X5HpAMY33HSAE2 OSI&0T&S. 

[02 2] *«L&thJMIl7;^5>**BBT5W 
f ^^7X5 HpKT9 lHSAE4©«jg?jei£^T 
0T»So 

PN8*tIE2] 

[«iBtt*®e*] £0 

HtiEfttH £S 
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I' AACCTTGGOA TCCACCCTCA CAA ATCCG A A CTTCCCCACC OTTTTAAACa CCTCCCTCaC 

G AAAACTTC A AAGCGCTOGT TCTGaTCCCT' TTCGCTCAOT ACCTTCACCA CTCCCCCTTC 

CACGACCACG TTAA ACTCCT AAACGAAGTA aCCC AATTCC - CTAAAACCT6 CCTaGCTCAC 

OAATCTOCAG AAAACTCCCA caaatcccto CACACCCTCT TCCCTCACAA ACTGTCCACT 

CTTCCGACCC TGCCCCAAAC CTACGGTGAA ATCGCTGACT 'CCTGCGCTAA ACAQOAACCC 

GAACCCAACC AaTOCTTCCT TCAGCACAAA CaCGACAACC csaacctccc gcccctcctt 

CGTCCGCAAC TCCACCTAAT CTGCACCfrCA TTCCACGACA ACGAAGAAAC CTTCCTGAaa 

a a atacctgt ACCAAATCCC ACCCCCTCAC CCGTACTTCT acccacccca actcctgttc 

t tcgctaaac CTTaCAAACC AGCTTTCACT CAATGCTCCC AGCCGCCTGA CAAACCGCCA 

TGCCTCCTGC CGAAACTCGA CCaaCTCCGT CACCAAGCTa ACCCCTCTTC TCCAAAACAC 

COTCTOAAAT CCOCTTCTCT CCAGAAATTC CGTGA AC GTO CATTCaAAGC CTCCCCAMT 

CCGCGCCTGT CCCACCCCTT CCCCAAaCCa GAATTCCCAC AA6TCTCTAA ACTCCTTACT 

CACCTCACCA AACTTCACAC CGAATGCTGC cagcgcgacc ttctacactg cccagaccac 

CCTCCCCACC TC C CC A A A T A CATCTCCCAA AACCAGC ACT CCATCTCTTC TAAACTGAAA 

CAATGCTCCC A A A A AC-CO C T CCTGGAAAAA TCTCACTCCA TCCCACAACT- ACAAAACGAC 

C a AATGCCGG CGGATC*TCCC CTCTCTGGCG CCTCACtTCC TTCAATCAAA acacgtgtgc 

AAAAACTACC CACAACCAAA A CACGT A + TC CTaGGTaTGT TCCTCTACGA ATACGCTCCT 

CGACACCCGG ACtACrCTCT GCTTCTGCTC CTGCCCCTCC CAAAAACCTA CCAAACTaCC 

ctcg A AAAAT CCTOCCCAGC ccctcaccca CACCAATCCT ACGCAAAAGT GTTCGACCAA 

TTCAAAGCGC tcgttcaaca AGCCCACAAC CTCA-TCAAAC AGAACTGCCA ACTGTTCAAA 

CACCTGGCTG AATACA AATT CCAGAACOCT CTC-CTGGTTC CCTACACCAA AAAGGTACCC 

CAGCTCTCTA CTCCCACCCT CCTCGAAGTA TCCCCTAACC TG6GT AA ACT TCCCTCT AAA 

TOCTOCAAAC .ACCCOOAACC CAAACCTATC CCGTCCGCCC-AAGaGTACCT CTCCCT0CT6 

CTC A AC CACC TCTCCCTTCT CCaCGAAAAA ACCCCC-OTTT CTCaEGCTGT AACTAAATCC 

TCCACCGA AT CT CTGGTT A A CCGCCCTCCC TCCTTCTCCG CTCTAGaOGT TOACGAAACC 

TACCTACCCA AaOaATTCAA CCCAGAAACC TTCACTTTCC ACQCCCACAT CTGCACCCTC 

TCCGAAAAAG AACGCCAOAT CAAAAAACAC ACCCCTGTOO TCOAACTOCT AAAACACAAA 

ccgaaagcaa ccaaacaaca actgaaaocg gtoatccacc acttggcagc tTtcctacaa 
a a atgctgca aacctcacga caaaqaaacc tgcttccctc aagaaco**a aaaactccta 

OCTOCOTCTC ACCCTGCACT CGGCCTCTAA TGATAGGATC C J" 



[02] 

I 

A»pAI«Hi«LytScrClu7«tAI a H I iAr rPh t Ly i AipL tuC I yG I u 

aaccttgccatcgaccctcac |aaat ccoaacttccccacccttttaaacacctccctcac 
ttccaaccctacctcccactcttta|cccttcaacccctgccaaaatttctccacccactc 
fojci = 1 = » ■ a n fi i s 



[03] 



[04 A] 



100 



20O 



300 



coo 



500 



Hindin 
Kptf 
Sail 
PstI 



Xbal 
SpW 



HiodC 

ACmWXATCCACCaCACAAATCCCAACTTCCGCACCCTmWACACCTCC^ 
ACCCTACCTCCGACTCTTTACCCTTCAACGCGTCGCAAAATTTC^ 

AACTTCAAAGCCCTCCnTCTCATCGCTTTCG^ 

TCCCCACCAAGACTACCGAAAiXCACTCATCCAAGtt^ 

CCACaTAAACTGGTAAACCAACrrAACCGAAnCCn"AAAA 

nCAaATTTGCTTCAnCGCTTAAXCGAnnGCACGCATCCACTCCTTAG PslI 



[04 B] 



2 

123 



45 



pHSA 



PstI 

GaaaACTGCGACAWTCCCTG CAC ACCCTGTTCCCTCAC AAACTCTO^ . 

ac uc i i U GACGCTtrmAMUca(rrGCGACAACCCACTtrrTrGAaam;ACAACca 

GAAACCTACGGTtUAATGGCTtUCTGn'GCCCTAAAW 
GK ACGCGCTTTGGArGC C ACT7T ACCG ACTGACCACG CGATnCTCClTGGCCTTGCGTrGCTT 

TCAGCACAAACACCACAACCCGAACCTGCCCCGCCTGGTTCCTCCGGAAG St It 
AXGAA£GAACTCCTCTnCTGCTCnGCGCTTCGACCtCGCCGACCAA^ 
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[04 C] 



[04 D] 



UXl 

TCCACCTAATCTCCACCCCAnCCACCACAACCMCAAACCnCCTCAAAAAATAC 
CCATTACAanKCCTMGCUCTGTTGCTTC^ 

aCTACGAAATCCCACCCCGTaCCCCTACTTCTACS^^ 
nAGCGTCMCCACTGGCCArCAAGATGC^ 

CTAAACCTTACAMGCACCTTTCACTCAAKCTCCCAGCCCCCTCACAAACCCCCATC 

ATCTTrCCTCC.VAACTWCTTACCACCCTCCCCCCACTCTTTCCCC Spht 



7Qv9 i 
Sphl 

CCTCCTGCCGAAACTGUCCMCTaGTGACG>AGGT^ 

CTACCWcaccccrrrcAiXTCcmAacACTccnamccccAOiACAC(rrmGTCCc^ 

CACQnCTCAAATCCCCnCTCTCCACAMnCCCTCAACCTCCATTCAAAGC 

AcmAcacwCACACCTcmwacAmcaccTAAcmcccACCcac 

CTWGtAGTT^CGCCTtTTCCAIXC CTTCCK AA^ 

AACWCCCCACACCC7CCCCAA(XXfCTnCCTCnAACCGTCT^ 

nCTAAACTCCmaCACCTWCCAAAlTICACACaUATCCTimca Xbal 
ACCAATCACTCCAaCCnTCAACTCTGCmACCACCCTCrCCCTKAAGATC 



[04E] 



Xbit 

CTAWCTCCCCACACCACCCTCCCCACCTWCauTACATCTCCCAAMCCACCAC^ 
KACCCCTaCCTOMUCCCCTCUCCCCmATCTACACCCTTTTGCTCCTW 



CTCnCTAAACTCAAACAATCCT< ^AAAAACC GCTCCTC CAAAAA TCTC*CTCCATCCCACAA 

TTCACTTTCTTACCACCti m ivCCGACCACCTTTnACACTCACGTACCCTCTTCATCTTTT 

GTAWAAAmCCAAATCCCCGCCGATCTttCGTCT^ 

CCTCCTTTACCCCCCC CTACACCCC AG^ACCCCCCACTCMCCAACnACTTTTCTCCACACCTT 

GTCTECAAAAACTACGCAGMGCAAAACACGTATTCCTAGGTATt^ Si 1 1 

TTTCATCCCTCTTCCTTTTCTCCATAACtATCCATACAACCACATCC^ 



[04 F] 



ACCtTTT 
ACTTTCC 



CTCCACCt 



[04 G] 



c:;cS?^SiISIiI*SIggSiS^I??IS?n?»I8iS e cI«SiI^?I? A T* CTTCCCT 
„. tM » f »»IUXK«SS8ttX£I«HSII**SSJSHSa8IKI«5888iX««e 



[E34H] 



CT:i «8II8tSSM«SI*gfI*Sg?^^IIS»g?i*i?«S?I.cTc»»« 

' e *8*8g88ISIi8I88«*SI88I?«*«8««S8«t88*To 6 TTT t T 
" c *^t»iIE}^fl88I8«88*a8«II§|g«!XIIg?I??TTTrT« 6 
* Tec I8g«*?gI8«8*g*«8«*ggX8iII888i8^?4?i8ATiTTTTc 
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[05) 



SD 



Met Asp AU Hb 



5' CGATTACTAAGGAGGTTTAAAATGGACGCTCAC 

TAATCATTCCTCCAAATTTTACCTGCGAGTGTTTA 



CI a I cohesive 



Fok 1 eohtaive 



[07] 

♦ (A) (B) 
M12 3M Ml 23 M 



S4K — 
67K — 
A3K — 

30K — 

20K — ' 



— 67K 




Fok 



BarrH 1 1 CWJCWTOOCmMMMOflCOWC 

Fok I 



Largest fraycarrt 
■ ,V8kb 



pTIISWco) 12* 

fCtoTj 

ap \.rjra] 

I BarrH I 
ICIat 



Large Iregwnt 
j t«kb 



+1 Cta 1 SD JttA»AUHial»i_Ser_ 

i.ottc*c B ti»aaaoootit|MAnA6TAAGGAMTnAAAAT6CAKETettit.lct 
ttcaiotocatintcccata^3TAATCAnCCTCCAAAmTACClCC6A0T6TrTAr- 




BmwH\ 



[08] 

Obitrte* : llet-Aa»-AU-Hii-Lyi-S«r-Cla-¥al- 
PredUted : ttet-A*MI«-HU-tji-Sir-CN-Val- 



Ala-m*-Arf-Pbe-tn-A«p-Lto-Clr 
Ali-Blt-Ari-Pbe-ty*-asp-Leu-Cl F 



[09] 




frSK527 3.3 kb 



EcoRl 




EcoRI_HlndBl 

, Sal! 




pUBIIt 43kb 



p»U4J7f 

7.7 kb 



[010] 



23 a » » a| 

LtuVi tleuAl iClyProAl*AlaAliSerAI aCi uThrAl aAanLrsSerAen 
TTCCTTCTCGCACCACCCCCCCCTCCCACTCCtCAAACCCCCAACAAATCCAAT 

C 
Km i 
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[013] 



[HI 4] 
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[015] 



[01 8 A] 



ng/ji 

250 12 5 64 32 16 8 4 2 1 0 broth broth' 5 

^SVi > \ J I ( ( ( 

Standards 

Uhr • • • • • • 

• • • • • 



HiodC 

AttTnraATCCACGCTCACrOfTOXAAr^ 

ACCCT ACCTCCCAirrCTTTACGCTTCAACCCCTCttAAAArrTCT^ 

AACTTWAACCCCTCCHCTCATCttTTTCCCTCACTA 

TCCCWCCAAWCTACCCAAAGCGACTCATKAACT^^ 

CCAC CTI AAACTCCTAAACCAACT AAC CCAArrCtXTAAAACCnXCTACCTCACC AATCTCCA 

nCACtATimTTamOTAACCCATrmCACCaTttACTtCTTAC PstI 



16 hr • • 
18hr • • 



12 3 4 S 6 1 2 3 4 5 6 



mi 6] 



• AAGCTTCGGA 
OAAAACTTCA 

caccaccacc 

OAATCTCrAC 
CTTCCCACCC 
GAACCCAACC 
CCTCCCC AAC 
AAATACCTCT 
TTCCCTAAAC 
TCCCTCCTCC 
CCTCTO A AAT 
CCCCCCCTCT 
CACCTGACCA 
CCTCCCCACC 
C A-A»T O C T G C G 
GAAATGCeOG 
A A A A ACTACG 
CCACACCCCC 
CTCCAAAAAT 
TTCAAACCGC 
CAGCTGGGTG 
CACOTGtCTA 
TCCTCCAAAC 
CTG AACGAGC 
TGCACCGAAT 
TACCTACCCA 
TCCGAAAAAG 
CCC AAACCA A 
AAATCCTGCA 
CCTCC6TCTC 



TCCACCCTCA 
AACCCCTCCT 
TTAAACTGGT 
AAAACTGCGA 
TGCGCGAAAC 
AATCCTTCCT 
TCGACGTAAT 
ACCAAATCOC 
OTTACAAACC 
CGAAACTCCA 
GCGCTTCTCT 
CGCACCOCTT 
AACTTCACAC 
TCCCC A AAT A 
AAAAACCGer 
CGCATCTGCC 

cagaaccaaa 

ACTA CTCTGT 
GCTCCGCAGC 
TGGTTCAACA 
AATACAAATT 
CTCCCACCCT 
•ACCC CCAAC C 
TCTCCCTTCT 
CTCTCCTTAA 
AAGAATTCAA 
AACGCCAOAT 
C C AA AC AAC A 
AACCTCACGA 
ACCCTCCACT 



C A A A TCCO AA 
TCTC ATCGCT" 
A AAC GAAGTA 
CAAaTCCCTG 
6TACGCT0AA 
TC AGCACAAA 
CTGCACCMA 
ACCCCGTCAC 
AGCTTTCACT 
CCAACTGCCT 
CCAGAA ATTC 
CCCGAAAGCA 
CCAATCCTCC 
CATCTCCCAA 
OCT CG A AAA A 
GTCTCTCGCG 
AC ACGT A TTC 
CCTTCTCCTC 
GCCTGACCCA 
ACCCCACAAC 
CCAGAACGCT 
CCTCGAACTA 
GAA ACGTATG 
CC ACCA A AAA 
CCCCCCTCCO 
CCCACAAACC 
CAAAAAACAC 
ACTG AAAGCG 
C A A AC A A A C C 
CCCCCTGTAA 



OTTCCCCACC 
TTCCCTCACT 
ACCC A ATTCG 
CACACCCTCT 
ATCCCTG ACT 
CACCACAACC 
TTCCACCACA 
CCGTACTTCT 
GAATCCTCCC 
CACCAACCTA 
CCTCAACCTC 
CAATTCCCAG 

caccgccacc 

AAC C AGO ACT 
TCTCACTOCA 
CCTOACT.TCG 
CTACCTATGT 
CTGCCCCTGG 
CACCAATCCT 
CTGA-TCAAAC 
CTGGTGOTTC 
TCCCGTAACC 
CCCT3CC C.CG- 
ACCCCGCTTT 
TCCTTCTCCC 
TTCACTTTCC 
ACCCCTCTCC 
GTGATCCACC 
TCCTTCCCrC 
tGATAGCATC 



CTTTTAAAGA 
ACCTTCACCA 
CTAAAACCTG 
TCOCTGAC AA 
C CTCCCCTAA 
CGAACCTGCC 
ACCAACAAAC 
ACGCACCGGA 
A CGCCGCTGA 
ACGCGTCTTC 
CATTCAAAGC 
AAGTGTCTAA 
TTCTACACTC 
CCATCTCTTC 
TCGCAGAACT 
TTGAATCAAA 
TCCTCTACCA 
C AAAAACCT A 
ACCC AA AAC r 
AGAACTCCGA 
GCTACACCAA 
T6GCTAA AC f 
.AACACTACCT 
CTGACCCTCT 
CTCTACAGCT 
ACGCCCACAT 
TGGA ACTCCT 
ACTTCGCAGC 
AACAAGCT-AA 
C J* 



CCTGCCTCAC 
CTGCCCGTTC 
C6T AOCTOA C 
ACT G TC C AC T 
ACAGGAACC6 
CCCCCTGGTT 
CTTCCTG AAA 
ACTGCTGTTC 
CAAACCGCCA 
TCC AAAACAC 
GTGGCCAG'TT 
ACTGGTTACT 
CCC ACACGAC 
T A A ACT G AAA 
AC A A A ACGAC 
AGACGTCTCC 
ATAC6CTCCT 
CCAAACTACC 
CTTCCACCAA 
ACTCTTC CAA 
AAACCTACCG 
TCCC TCT a A A 
CTCCGTGGTC 
AACTAAATCC 
TCACCAAACC 
CTCCACCCTG 
A AA AC AC AAA 
TTTCT5T AC A A 
AAAACTGGTA 



II 7] 



II 8B] 



100 



HiAdlD 
Kpnl 
Sail 
Pstl 

Xbal 
SphI 

BanM* 



200 



300 



400 



500 



PtlI CAAAAC7XXCACAAATCCCTCCAaCCCTCnCCC TCACAAACT ^ 

ACPrTmcAcaTCTn-A(mc(^trrB(»Acwa 

<LUACCTACC<rrCAMT-SC^ 
c^CCCCCTTTaATCCCACmACC^CTGACaCCaATTTCTCCm 



TCACCACAJWCACCACWCCCCAACCTCaCCWCT(OTCCTCCCCAAC 
ACCAAGCAAC 



StlC 



2 
123 



45 



7 

678 
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[01 8 C] 



CHI 8D] 



TCGACCTAATCTOACCCCATTC^M^^ 

CCATTACACCTWCCTAACCUCTCTTCCTTCTTTCCAACCAC i i 1 1 i lATCGACATGCT ■ 

nCTACCAMTCOACWCCTCA^ 

nASCXrrW^ACTCCCCATCMCATCCCTCCCCTTCACCACAA^ 

CTAMCCTTACAAJmiOTCOTCUT^ 



^^^CTCCTCCCCAAACTCGACCWCTCCCTCACCAACCTAACCCCTCrrCT 
CAceCT CTnAlATCCCCTTCTCTCCAC AAATTCCGTGAACCT CCATTCAAAGC 

AACCCCCG^aCWTCCCCMCCCCTnCCTmAAiraCTTaCACATTTC 

CTCTAAACTCmACTCACCTUOAAA^CAt^ ^ 
ACCAATCACTW^CTtOTCAACTCICCCnACCAaiTTCCCCaCCAACATC 



[01 8 E] 



Ibil 

CTACAI^CCCAaACMCCCTGOCTACCTCC^ 

TCAC(OTCTOTGCCACCCa«ACCCCmATrrAGACCCTmCCTCCTGACCTACACM 



TIt*CnTCTT*CCACCCTTTTTC5CaCC*Ctl 1 1 1 IACACTCACGTACCGTCTTCATCTTTT 

nmt**M*TOaiCAC*ACC*iWAC«CT»nccTACCT*TCnca(TACCMT*ccacc un 



II 8F] 



niA 848S8884SX4SI8I8I?8XIgXgSIggXggggSXig8444448gX4S&444SX4SgcceTTTT 
" CCAAAA 4XggIgggS48§88SXg A 88S4&48g44X8&X4g884444gI8XXgg4S844.AeTTTec 
""* ACC i8I88II8**8?*S888f8?*88I'8«8^«*8 A 48X888t*8I8II8?*4oT C oxccc 
CA6 " c "XS44l48 A 44II8S48MSg8IgI§8I88XI8ggX484g«f44 A 8 etAC M .« 



[01 8G] 



c»Tc888488I8I8I A 8I888 A 888I88I88i A 8I A I88g8I4 A 88I888I AAAC 

TCAACC 



0 *8I AAA I88IS8 AAA g488888 AA 888 AA 488I A I8888I8888884 A 8 A 8I ACCTCTC " 



ccacacccI 



CTCACCCTCTAACTA 



88I88I8*488 A 88I8I88?II8I88 A 8? AAAA 4*888888III8I8t888 
T AA I88J88 A 888 A «8I8X88II A *888SS8I888I88II8I8888Io»Te "* r 

m 1 8 H] 



^M»ggrig A gg A 4 A T88X A 88I$888 AAA 8 AA II8 AA 888 A 8 AAA 88I*cTc»*.e 
"*"" c 8488888 a 8 a X8X88 a 888X8I888' a ' aa 8 aa 8888 a 84Xcttttttc 

CAAMAACA^gCJJJJggT^AAeTgJTAA.^ 
«CC»AACAAgAAgtgAAA.ggggTgA ; ggACG»gJTC^ 

AAAAAT6C I88n A 88I8t88 A 8 AAA 84 AA 88X88XX8l8X8 AA 8 AA 88ArTTTTTc 
TAAAAAAe X88X A 88J888X8I84888J88 A 8I88888I8X AA X8 A X48crAc 
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il 9] 



[020] 



pHSAH2 4.5* 

Hin dill 




FoKI 




Not J 



AGCTTCTAATOATACC GGCCGCOCAOTGCTOACaCTCAC 

ACATTACTATCGCCGG CGCCTCACGACTCCGAOTGTTTA 



H inilii cchnfv, 



HI Nil I li.KI 



Fokl 



BarrH I 
Fokl 



cGOTAOTX*cwamAAMTOKanac | BamH I 
rmtKTncmMim*ajoaunernk Clot 



Largest f raiment 
IBkb 



L 



Large fngnant 
i 18 kb 



* CU f SO Jbt^AlaHIalwScr 

aafittcacBla3aea M atat|^TTAGTAAGGAC<imAmT<MiAC0^CAC«MuH 
tl etas tsc« 1 1 1 1 1 ccce t a g3f AATCAnCCTCCAA ATTTTACCTGCGAGTGTTTAg 




4.4J* 




pUCIt 



«i n* I II 
I a ■ H i 



Fokl ceh««i vi 




<3kb Fokl 



I Fokl 
| 3u»Hl 



L»r B »*t frofstnt 
I LI kb 



Hln« I 1 1 w^»o t I 



fUC33H^Ae 
Ok* 




BauHl 



[02 1] 



1.1 » 



P UC35HSAE 

4.S kb 
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[022] 




(51)Int.Cl. 5 




CI 2R 


1:19) 


(CI 2N 


1/21 


C12R 


1:125) 


(CI 2P 


21/02 


CI 2R 


1:19) 


(CI 2P 


21/02 


CI 2R 


1:125) 


(CI 2P 


21/02 


CI 2R 


1:08) 



m\m^ st ftgas^ f i &.m&*®m 
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